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ABSTRACT: Recent accounting scandals have resulted in regulatory Initiatives de- 
signed to strengthen audit committee oversight of corporate financial reporting and 
have led to a concern thai U.S. GAAP has become too rules-based. We examine 
issues related to these initlatives using two experiments. CFOs іп our experiments 
exhibit more agreement and are less likely to report aggressively under a less precise 
(more principles-based) standard than under a more precise (more rules-based) stan- 
dard. Our results also indicate that CFOs applying a more precise standard are less 
likely to report aggressively In the presence of a strong audit committee than a weak 
audit committee. We find no effect of audit committee strength when the standard is 
less precise. Finally, we find support for a three-path mediating model examining 
mechanisms driving the effect of standard precision on aggressive reporting decisions. 
These results should be of interest to U.S. policymakers as they continue to contem- 
plate a shift to more principles-based accounting standards (e.g., IFRS). 


Keywords: standard precision; rules-based standards; principles-based standards; 
audit commitiee; IFRS. 
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L INTRODUCTION 

ur study examines the effect of accounting standard precision on financial statement 
О preparers’ reporting judgments, as well as the potential role that the audit committee plays 

in mitigating aggressive financial reporting under differing levels of standard precision.’ 
A wave of corporate accounting scandals in recent years has led to a push for regulatory changes. 
As part of the Congressional plan to reform U.S. financial reporting, the Sarbanes-Oxley Act of 
2002 (SOX, U.S. House of Representatives 2002) introduced major regulatory initiatives in an 
attempt to overhaul financial reporting and corporate governance systems. These initiatives in- 
clude two remedies aimed at dampening aggressive financial reporting: (1) the potential adoption 
of principles-based accounting standards and (2) enhancement of public company audit commit- 
tees. 

Concern has been rising within the financial/investing community that U.S. accounting stan- 
dards have become too "rules-based." With bright-line tests and detailed guidance, U.S. standards 
have become so precise that many feel they invite opportunistic interpretation by corporate ex- 
ecutives. The perception that a significant number of executives have been concerned with meet- 
ing the letter of a rule, more so than its spirit, has led to a call to consider a more “principles- 
based" regime. Accordingly, SOX required the Securities and Exchange Commission (SEC) to 
conduct a study on the possible adoption of principles-based standards by the U.S. financial 
reporting system. Recently, the SEC moved the principles-versus-rules debate to the front burner 
by proposing a roadmap that could lead to the mandatory adoption of the more principles-based 
International Financial Reporting Standards (IFRS) by U.S. public companies (SEC 2008)? 

Additionally, the Financial Accounting Standards Board (FASB) issued a proposal for a 
principles-based approach to U.S. standard-setting and asked interested parties to comment (FASB 
2002). Many who responded were supportive of a principles-based model, believing that such an 
approach would lead to higher quality financial reporting with less opportunity to “exploit the gaps 
in GAAP" (CalPERS) or use “financial accounting engineering” to get around detailed, rules- 
based standards (PricewaterhouseCoopers).? Conversely, some argued that less specificity in ac- 
counting standards would result in an increase in manipulation of financial results (e.g., Intel and 
former FASB member David Mosso). On the separate issue of inter-firm comparability, a number 
of letter-writers expressed concern that, with an increased reliance on judgment, implementation 
of principles-based standards could result in a decrease in comparability across firms (e.g., IBM, 
Pfizer, Goldman Sachs, BDO Seidman). The ongoing debate over whether and when to move 
toward more principles-based accounting standards reflects the uncertainty of both the anticipated 
desirable and undesirable effects of such a paradigm shift. Although these are empirical questions 
that cau be addressed experimentally to inform policy makers prior to making a move to 
principles-based standards, the academic literature is limited with respect to research regarding 
these uncertainties (Maines et al. 2003). 

As noted, the possible adoption of principles-based financial reporting standards is only one 
of the regulatory remedies embodied in SOX to improve U.S. financial reporting. Section 301 of 
SOX also expands the responsibilities of the audit committee and places greater focus on its role 
in enhancing the quality of financial reporting. Section 407 requires public companies to disclose 


Similar to Hackenbrack and Nelson (1996), we define aggressive financial reporting as the favorable portrayal of a 
company's financial situation even when that reporting is not clearly indicated by the facts. 
2 On February 24, 2010, the SEC issued a statement reaffirming its support for global accounting standards while pushing 
back the earliest date for adoption of IFRS to 2015 (SEC 2010). 
For example, several organizations including Computer Sciences Corporation, PricewaterhouseCoopers, and Financial 
Executive International made comments indicating that a principles-based approach would result in transactions being 
accounted for according to their economic substance rather than their form (see the FASB website for these and other 
comment letters: http://Awww.fasb.org/jsp/FASB/CommentLetter_C/Comment_etterPage ?project_id = 1125-001). 
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whether at least one member of the audit committee is a financial expert. Archival studies have 
shown audit committee strength to have an effect on the financial reporting process in rules-based 
standards environments (Klein 2002; Bédard et al. 2004). However, evidence on its effect in more 
principles-oriented environments and on the interactive effect of standard precision and audit 
committee strength is lacking (Libby et al. 2002). 

To explore these issues, we conduct two experiments in which experienced U.S.-based finan- 
cial statement preparers are placed in a lease classification decision context. We manipulate stan- 
dard precision between participants, where more precise and less precise standards are based on 
FASB Statement No. 13 and International Accounting Standard (IAS) 17, respectively. Audit 
committee strength is manipulated between participants as either strong or weak. 

The results of experiment 1 indicate that financial statement preparers are less likely to report 
aggressively (1.е., they are more likely to capitalize the lease) when applying a less precise (more 
principles-based) lease classification criterion than when applying a more precise (more rules- 
based) criterion. This result provides support for the SEC’s and FASB’s hope that a move toward 
more principles-based standards could result in better, or less aggressive, financial reporting. 
Consistent with prior archival research, we also find that preparers asked to apply a more precise 
standard are less likely to report aggressively in the presence of a strong audit committee than in 
the presence of a weak audit committee. However, we find that the influence of a strong audit 
committee diminishes in the principles-based setting. Interestingly, we find significantly less vari- 
ability among preparers’ reporting decisions when a less precise standard is in place. This suggests 
that, contrary to the concern of some interested parties, the application of more principles-based 
standards need not result in less comparability than more precise standards. Experiment 2 confirms 
the findings of experiment 1 and finds support for а three-path mediation model. Specifically, we 
find that the 1е88 precise the standard, the more concerned preparers are about second-guessing and 
possible costs imposed through regulation and litigation. This, in turn, results in an increased 
desire to reflect the underlying economics of transactions and events in the financial statements 
and, ultimately, in less aggressive reporting. 

The findings of our study suggest a number of important implications. For example, these 
findings should be of interest to policy makers as they contemplate whether and when to adopt 
IFRS. Our results have encouraging implications for U.S. financial reporting for a move to a more 
principles-based accounting standards model. Also, our study provides insight regarding recent 
regulatory changes that address the role and composition of audit committees. Our results suggest 
that, under certain circumstances, principles-based standards can ease the burden on an audit 
committee (whether strong or weak) to curtail management’s aggressive reporting choices. 

Section П provides background information and develops our hypotheses. Sections Ш and IV 
describe the experiments used to test hypotheses and present the results. Section V provides a 
summary and offers conclusions, implications, and suggestions for future research. 


П. BACKGROUND AND HYPOTHESES 
Principles-Based versus Rules-Based Accounting Standards 


The Current Environment 

Much of the recent interest in IFRS is predicated on the notion that these standards are 
principles-based, whereas U.S. GAAP is described as (and often criticized for) being rules-based. 
Rules-based standards typically provide very detailed guidance with bright-line tests. A perceived 
benefit of more detailed implementation guidance is greater comparability of financial statements 
across companies (Schipper 2003). On the other hand, it has been suggested that rules-based 
standards lead to a “show me where it says I can’t” attitude, which, in turn, can lead to dysfunc- 
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tional financial reporting behavior (Weil 2002, 3). Excessively detailed reporting guidance can 
invite transaction structuring and incentive-consistent standard interpretation to achieve preferred 
accounting treatments (FASB 2002; Bockus et al. 2003; Nelson 2003). 

The FASB’s Statement of Financial Accounting Standards (SFAS) No. 13, Accounting for 
Leases, with its detailed list of criteria for lease classification containing several bright-line thresh- 
olds, is cited as the “poster child” for rules-based standards (Maines 2007, 360). In developing 
SFAS No. 13, the FASB had hoped that explicit lease classification rules would eliminate indi- 
vidual judgment, resulting in consistent application of the standard across firms (Shortridge and 
Myring 2004). However, because bright-line tests were established, companies have been able to 
structure and interpret lease contracts to avoid capitalization, which tends to present a more 
favorable picture of a company’s overall financial condition (Imhoff and Thomas 1988; Pulliam 
1988). As a result, “the explicit rule allows the off balance sheet financing to continue, and 
provides justification for the treatment” (Shortridge and Myring 2004, 3; emphasis added). 

Principles-based standards, which provide limited interpretive and implementation guidance, 
are the perceived solution to problems caused by rules-based standards. Less guidance, in theory, 
increases the need to apply professional judgment consistent with the intent of the standards. Both 

. the FASB (2002) and the SEC (2003) believe that this will ultimately result in more meaningful 
and informative financial statements.* 


Related Research 


Several experimental studies have explored auditor judgments in light of specific accounting 
standards or guidance (e.g., Gibbins et al. 2001; Ng and Tan 2003; Kadous et al. 2003). For 
example, Ng and Tan (2003) find that, in the presence of a weak audit committee, auditors are 
more likely to allow aggressive revenue recognition when no guidance/rule governs the transac- 
tion than when there is authoritative guidance specifying the appropriate reporting treatment. 
Trompeter (1994) finds that, when authoritative guidance limits the range of acceptable accounting 
treatments, audit partners are less influenced by client preferences. In an imprecise standard 
context, Hackenbrack and Nelson (1996) show that auditors approve reporting decisions that are 
consistent with their incentives and use the vagueness of the imprecise standard to justify their 
decisions. While these studies provide some evidence as to how auditors react to the existence or 
specificity of authoritative guidance, they assume, either implicitly or explicitly, that managers 
will always choose to report aggressively. This line of research does not speak to how standard 
precision influences the level of aggressiveness in reporting decisions of financial statement pre- 
parers. 

Three studies have employed an experimental approach to examine the influence of standard 
precision on reporting decisions made by financial statement preparers (Hoffman and Patton 2002; 
Psaros and Trotman 2004; Jamal and Tan 2010). Hoffman and Patton (2002) hypothesize that less 
precise standards for governmental financial reporting provide more latitude for federal financial 


4 Specifically, the FASB (2002) concludes that adopting a principles-based approach will result in greater judgment, 
leading to: accounting treatments that conform to the substance of a transaction, improved transparency, enhanced 
comparability, increased responsiveness to emerging accounting issues, and facilitation of international financial report- 
ing standard convergence. 

Cuccia et al. (1995) explore rule precision in a гах setting, finding that moving from a vague rule to a more precise rule 
has no effect on tax practitioners’ propensity to take an aggressive position on an individual's retum. When a vague 
standard exists, tax preparers use the vagueness in the standard to justify their aggressive tax position; when a precise 
standard exists, they use the vagueness in facts to justify their position. However, as the authors point out, results from 
their study of tax practitioners might not generalize to financial reporting decisions because of differences in penalties 
and professional responsibilities (Cuccia et aL 1995), as well as in the scope of impact (e.g., aggressive reporting in an 
individual tax setting affects a few people at most, limiting the financial/social impact, while aggressive financial 
reporting by publicly traded companies can have a much broader and deeper impact). 
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officers to reach reporting decisions that are aligned with their incentives. Contrary to expecta- 
tions, they find that replacing a less precise recognition threshold (“probable”) with a more precise 
threshold (“more likely than not”) does not affect the likelihood that these preparers will recognize 
a contingent loss. They speculate that the lack of an effect could be due to a weak manipulation, 
as participants did not appear to perceive the new threshold to be more precise than the original. 

Using two separate experiments involving a consolidation judgment, Psaros and Trotman 
(2004) consider whether corporate accountants will justify aggressive judgments by more aggres- 
sively interpreting case-specific information. Results of the first experiment, conducted in a 
“substance-over-form” (less precise) standard setting, suggest that accountants who make more 
aggressive judgments tend to interpret case-specific information more aggressively. They obtain 
similar results in the second experiment, which is set in a more precise standard environment. As 
the researchers did not include both levels of standard precision in one experiment, they are unable 
to provide direct evidence on the relative impact that the level of standard precision has on 
reporting behavior. However, by comparing results from the two experiments, they suggest that 
aggressive reporting is more likely in rules-based settings. Psaros and Trotman (2004) call for 
further research designed to more directly compare the relative influence of standards of differing 
precision on preparers’ reporting decisions. 

Jamal and Tan (2010) examine whether auditor type (e.g., principles-oriented or rules- 
oriented) affects financial managers’ reporting decisions under rules-based and principles-based 
standards. They find that auditor type has no effect on reporting decisions under a rules-based 
standard. However, under a principles-based standard, financial managers are less likely to report 
aggressively when the auditor is principles-oriented. They speculate that improved financial re- 
porting will result only if a move toward more principles-based standards is accompanied by a 
shift in auditors’ mindsets toward being more principles-oriented. This shift would appear likely to 
happen relatively quickly because auditors have a responsibility to assure that their clients are in 
compliance with prevailing standards, regardless of standard type. That is, if there is a move 
toward principle-based standards such as IFRS, a “rules-based” auditor would quickly have to 
adjust to an environment in which there are no bright-line thresholds on which to fall back. Thus, 
the cells from their study that are most relevant for purposes of our study are the two “matched” 
cells (1.е., principles-oriented auditor/principles-based standard and rules-oriented auditor/rules- 
based standard). Results from these two cells are not inconsistent with the notion that principles- 
based standards will typically result in less aggressive financial reporting by preparers. 


Standard Precision and Aggressive Financial Reporting 

Rules-based standards have developed over time, in part, because of demand for them by 
financial statement preparers and auditors, who believe that such detailed rules shield them from 
potential criticism for aggressive reporting (Benston et al. 2006). Aggressive reporting is likely to 
be more difficult to justify with a less precise standard in which a threshold is not explicitly stated 
(Maines 2007). The costs associated with being perceived to be out of compliance are similar 
under both more precise and less precise standards. However, the uncertainty surrounding the risk 
of being perceived to be out of compliance (i.e., second-guessed) is inherently greater without 
detailed guidance (Nelson et al. 2002). Thus, preparers applying less precise standards could 
decide not to select their desired accounting treatment because of the increased risk of second- 
guessing and the attendant costs. Theoretical research in the areas of law and economics supports 
the idea that less precise rules could result in less aggressive decisions (Calfee and Craswell 1984; 
Craswell and Calfee 1986). This outcome could be more likely in the post-SOX era because of 


5 This assumes a strong regulatory enforcement environment (e.g., the post-SOX regulatory environment). 


The Accounting Review May 2011 
| American Accounting Association 


752 Agogita, Doupnik, апа Tsakumis 


required CEO and CFO certification of the financial statements. Following this reasoning, along 
with a belief that “objectives-oriented standards allow accounting professionals to operationalize 
accounting treatments in a manner that best fulfills the objective of each standard and thereby best 
captures the underlying economic reality,” the SEC (2003, 11) and the FASB (2002) have indi- 
cated a preference for less precise standards. 

There is an implicit assumption underlying this belief that, when the constraints imposed by 
rules-based standards are removed, financial statement preparers will desire to make accounting 
choices that best reflect economic reality. However, the FASB (2002, 9) acknowledges that a 
principles-based approach "could lead to abuse, whereby the principles in accounting standards 
are not applied in good faith consistent with the intent and spirit of the standards." There are a 
number of environmental factors that could influence whether a principles-based standard regime 
leads to more meaningful financial reporting or to abuses of the less-precise guidance. For ex- 
ample, the level of regulatory scrutiny of accounting treatment choices, the degree of consensus 
among regulators about the appropriate treatment, and the ability of rules-based standards to 
constrain aggressive choices could influence the efficacy of a principles-based regime. 

We choose to examine a setting that best represents the concerns/complaints levied by inves- 
tors and regulators. Specifically, we examine a rules-based standard that has been unable to 
constrain aggressive behavior, even in a strong post-SOX regulatory environment in which there is 
concern about aggressive reporting choices." In such a setting, we expect that the level of standard 
precision will affect the extent to which financial statement preparers are concerned about second- 
guessing and possible costs imposed through regulation and litigation. Financial statement prepar- 
ers should be more concerned about second-guessing when standards are less precise (Nelson et 
al. 2002). This concern will result in an increased desire to reflect the underlying economics of 
transactions and events in the financial statements. Presumably, if preparers can show that their 
accounting choices fairly present economic reality, then they will be better able to defend them- 
selves when second-guessed by external parties (e.g., the SEC). Thus, we argue that less precise 
standards result in an increased concern about second-guessing and related costs, which in turn 
results in a greater interest in fair presentation. This reasoning leads to the following hypothesis: 


H1: Financial statement preparers applying a less precise financial reporting standard will be 
less likely to make an aggressive financial reporting decision than preparers applying a 
more precise standard. 


The Influence of the Audit Committee on Aggressive Reporting in Principles-Based and 
Rules-Based Settings 

The audit committee represents a key corporate governance mechanism, in that it is viewed as 
one of the most important monitors of the financial reporting process (Blue Ribbon Committee 
1999). However, a spate of high-profile corporate accounting scandals (e.g., Enron, WorldCom, 
Tyco) brought the effectiveness of audit committees into question. Regulators and investors have 
called for stronger audit committees to help ensure higher quality financial reporting. Congress 
responded by enacting SOX, which, among other things, expands the responsibilities of the audit 
committee and places a greater focus on its role in enhancing the financial reporting process. 
Under SOX, the audit committee now is responsible for hiring the company's independent auditor, 
overseeing the work of the auditor, resolving financial reporting differences between management 
and the auditor, and monitoring internal controls (Lander 2004). Concurrent with the expansion of 


7 This also represents a setting in which we would be likely to find a benefit of a principles-based approach, should such 
8 benefit exist. For example, if a shift to principles-based standards is accompanied by a reduction in the level of 
regulatory scrutiny, the potential abuses the FASB (2002) acknowledges are more likely to occur. 
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the audit committee’s role, there has been a push toward greater independence and expertise of 
audit committee members. SOX now requires all members of the audit committee to be indepen- 
dent (i.e., not currently part of management). Companies must now alsc disclose whether at least 
one member of the audit committee is a financial expert. Further, while there are no strict guide- 
lines regarding the frequency of audit committee meetings (the Blue Ribbon Committee [1999] 
recommends at least four meetings per year), some have called for audit committees to be more 
active and diligent in carrying out their oversight duties (DeZoort et al. 2002; Stewart and Munro 
2007). 

Thus, conventional wisdom is that audit committee independence, expertise, and activity can 
enhance the monitoring and oversight of management, leading to less aggressive financial report- 
ing choices. Prior research appears consistent with this notion (Bédard et al. 2004). Results of 
archival studies examining U.S.-listed firms suggest that independent members are less likely to be 
sanctioned for fraudulent or misleading financial reporting, and are negatively associated with 
abnormal accruals, aggressive earnings management, and earnings restatements (Abbott et al. 
2000; Klein 2002; Abbott et al. 2004; Bédard et al. 2004). Similarly, financial expertise on the 
audit committee is negatively associated with discretionary accruals and aggressive earnings man- 
agement and is viewed favorably by capital markets (Xie et al. 2003; Abbott et al. 2004; Bédard 
et al. 2004; DeFond et al. 2005). Further, the frequency of audit committee meetings is negatively 
related to financial reporting misstatements, earnings management, and perceived audit risk 
(Abbott et al. 2000; Stewart and Munro 2007). Thus, in the rules-based U.S. reporting environ- 
ment, a strong audit committee can serve as a monitoring mechanism that helps mitigate арргез- 
sive financial reporting behavior. 

However, it is less clear whether the same relationship exists, or is as strong, in environments 
that employ principles-based accounting standards. Assuming a relatively strong regulatory envi- 
ronment, preparers using principles-based standards are likely to perceive greater risk of regulator 
sanctions due to the inherent uncertainty of the standard. This uncertainty could lead to an in- 
creased desire to reflect the economic substance of a transaction, in turn lessening the audit 
committee’s burden to constrain aggressive reporting choices. Many countries using the less 
precise IFRS either do not require audit committees (e.g., Russia) or they are fairly recent devel- 
opments (e.g., the European Union) and, as a consequence, are likely weak relative to U.S. audit 
committees. Thus, there is no direct evidence regarding the relationship between audit committee 
strength and aggressive reporting in a principles-based setting, and what indirect evidence there is 
does not lead to clear conclusions regarding the existence or strength of the relationship (Collier 
and Gregory 1996; Peasnell et al. 2000; Chen and Cheng 2007). However, there are certain 
settings under which inferences could be drawn regarding the influence of audit committee 
strength in principles-based standard environments. 

Recall that, in our setting," we expect that preparers applying a more principles-based stan- 
dard will be more concerned about regulator second-guessing and capturing the economic sub- 
stance of the transaction than preparers applying our rules-based standard, irrespective of the 
strength of the audit committee (i.e., the НІ expectation). If this is the case, then preparers 
applying the less precise standard will be more likely than preparers applying the more precise 
standard to choose the treatment preferred by the audit committee, thereby lessening the burden on 
the audit committee to curb aggressive reporting. That is, the less precise standard should dampen 
pursuit of aggressive financial reporting regardless of audit committee strength, reducing the need 
for the audit committee to exert its influence regarding this reporting choice. We therefore test the 
following hypothesis: 


* Our setting is one in which the rules-based standard has been relatively ineffective at curtailing aggressive reporting, 
even in a strong post-SOX regulatory environment where there is concern about aggressive reporting choices. 
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Н2: The difference in financial reporting judgments made in the presence of a strong audit 
committee versus a weak audit committee will be greater when a financial reporting 
standard is more precise than when it is less precise. 


Ш. EXPERIMENT 1 
Participants and Design 

Participants in experiment 1 are 96 experienced financial statement preparers (55 CFOs, 23 
Controllers, and 18 VPs of Finance) with an average of 25.5 years of professional work 
experience.” As our experiment asks participants to assume the role of a company controller faced 
with a lease classification decision, it was important that we select experienced executives who are 
likely to play a key role in the financial reporting decisions of their companies. 

To test our hypotheses, we conduct an experiment requiring participants to make a lease 
classification decision in which the two variables of interest (standard precision and audit com- 
mittee strength) are manipulated at two levels. The more precise condition for standard precision 
includes a bright-line criterion for lease capitalization (lease term is “equal to 75% or more” of 
expected life) consistent with SFAS No. 13 (FASB 1976). The less precise condition includes a 
vaguer criterion (lease term is “for the major part” of expected life) based on IAS 17 (IASC 1997). 
In the strong audit committee condition, participants are told that all audit committee members are 
independent with no disclosed relationship with the company and all qualify as financial experts as 
defined by the SEC, and that the audit committee meets frequently (11 to 12 times per year). For 
the weak audit committee condition, while all committee members qualify as independent, par- 
ticipants are informed that one member has no disclosed prior relationship with the company (two 
are former officers of the company), one qualifies as a financial expezt, and the committee meets 
infrequently (two to three times per year). ® 


Procedures 

We provided participants with a research instrument containing four sections. Section I in- 
cluded guidelines for classifying a lease as either a capital or operating lease. Participants were 
told only one criterion (relating to the ratio of lease term to estimated economic life) was relevant 
to the lease classification decision. Specifically, one group of participants was instructed that a 
lease must be classified as a capital lease if the lease term is “equal to 75% or more” of the 





9 We mailed instruments to 1,000 individuals identified by the American Institute of Certified Public Accountants 
(AICPA) as financial executives. We received replies from 106 individuals and 63 were returned as undeliverable. The 
resulting response rate of 11.31 percent (106 responses divided by 937 delivered) is consistent with prior studies 
involving CFO/controller participants (e.g., Graham and Harvey 2001; Gibbins et al. 2007; Sanchez et al. 2007). There 
were ten unusable responses: seven instruments were completed by inappropriately classified individuals such as staff 
accountants, payroll clerks, and tax accountants (our conclusions remain the same with or without these individuals), 
and three were returned with no response on the dependent variable. Thus, 96 usable responses remained. Comparisons 
of early and late responders indicate no significant difference, suggesting that nonresponse bias does not drive our 
results. With respect to the demographics of our participants (e.g., current or prior experience working at a publicly 
traded company or as an external auditor, experience with leases), there were no significant differences between 
conditions and, when included in our analyses, the demographic variables were neither significant nor altered the 
conclusions we draw. 

The features present in both our strong and weak audit committee conditions were selected to represent audit commit- 
tees that can, and do, exist in today's regulatory environment. For example, our weak committee meets the SOX 
requirement of independence (no members are currently affiliated with the company) and recommendation for expertise 
(one member qualifies as a financial expert). With respect to independence, there are numerous publicly traded compa- 
nies with former employees serving on their audit committees (e.g., FedEx, Goldman Sachs, Kohl's, and Sunoco). 
Similarly, many firms have a sole designated financial expert (Carcello et al. 2006], including H.J. Heinz Co., CSX 
Corp., Texas Instruments, Inc., and Nike, Inc. Further, as there are no requirements for frequency of meetings, there is 
much between-firm variation. Consistent with our weak audit committee manipulation, a number of firms' audit com- 
mittees meet infrequently (1.е., four or fewer meetings; see Huron 2006). 
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estimated economic life of the leased property (the more precise standard based on SFAS No. 13). 
Another group was instructed that a lease must be capitalized if the lease term is “for the major 
part” of the estimated economic life of the leased property (the less precise standard based on IAS 
17). Both groups were told that a lease must be classified as an operating lease if it does not meet 
the capital lease criterion. In addition, participants were provided the following definitions of 
“lease term” and “bargain renewal option” based on definitions in both SFAS No. 13 and IAS 17: 


» Lease term is defined as the fixed non-cancelable term of the lease plus all periods 
covered by bargain renewal options. 

» Bargain renewal options allow the lessee to renew the lease for a rental sufficiently 
lower than the fair rental of the property such that exercise of the option appears, at the 
inception of the lease, to be reasonably assured. 

Given the definition of lease term, lease classification requires financial statement preparers to first 
judge whether any renewal option embedded in the lease is a bargain. Second, preparers must 
judge whether the lease term, i.e., the number of years in the fixed non-cancelable portion of the 
lease plus the number of years in any bargain renewal option, meets the threshold established in 
the standard. 

Section П asked participants to assume the role of controller of a hypothetical company that 
has entered into a lease for new equipment with an estimated economic life of ten years. The lease 
has a non-cancelable lease period of seven years. At the end of the initial non-cancelable lease 
period, the agreement provides the company the option to renew the lease for an additional year, 
with the monthly rental payment set at a rate that allows for some discretion in judging whether 
the renewal option represents a bargain.!! Participants applying the “equal to 75% or more” 
criterion must exercise their judgment in measuring the “lease term” by determining whether the 
lease renewal option represents a bargain. They can justify an operating lease classification by 
arguing that the renewal option does not represent a bargain and therefore the lease term (seven 
years) is only 70 percent of the asset’s economic life (ten years). However, if they judge the 
renewal option to be a bargain, then they should add the bargain renewal option period (one year) 
to the initial lease period (seven years) and view the lease term as eight years, which is 80 percent 
of the asset's economic useful life. In this case, the bright-line threshold in the standard should 
cause them to classify the lease as a capital lease. Participants applying the less precise standard 
must exercise similar judgment in determining whether the lease renewal option represents a 
bargain. However, their interpretation of "for the major part" also factors into their lease classifi- 
cation decisions. 

Participants were provided a summary of the differential impact of the two accounting treat- 
ments on the company's financial statements and key ratios at the end of the first year of the lease. 
The summary shows that capitalization of the lease generally produces less favorable financial 
results, including lower EPS and return on assets figures and a larger debt-to-equity ratio. Thus, 
participants generally should have an incentive to classify the lease as an operating lease. To make 
this incentive more salient, participants were told that the company is publicly traded and that it 


И Controllers at several Fortune 500 companies suggested that setting the renewal option at approximately 90 percent of 
the fair rental value would be sufficiently ambiguous to require judgment as to whether it represents a bargain (і.е., 
whether to include it as part of the total lease term). Thus, we set the option at 90 percent to provide an opportunity for 
the financial incentives of alternative classification to factor into participants’ lease classification decisions. 

12 That is, for the renewal option to place a participent applying the less precise standard in a similar professional decision 
context as those applying the more precise standard, a less precise standard participant must interpret "for the major 
part" as 271 and S80 percent of economic life. We, therefore, conduct an analysis comparing those participants in the 
71 to 80 percent range to those in the more precise group (1.6., for both groups of participants, the judgment regarding 
the option results in a different classiiication). We discuss this in greater detail below. 
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intends to raise capital in the second year of the lease term through the issuance of both common 
stock and the sale of bonds. After considering the case information, participants recorded their 
lease classification decisions and responded to several post-experimental and demographic ques- 
tions (Sections Ш and IV of the instrument). 


Experiment 1 Results 


Our hypotheses are tested using a 2 X 2 ANOVA (standard precision by audit committee 
strength) with the financial statement preparer’s lease classification decision serving as the depen- 
dent магіабів. З Due to the directional nature of expectations, all tests of hypotheses are one-tailed. 
Cell means and ANOVA results are presented in Table 1.“ 


Standard Precision 

НІ predicts that financial statement preparers applying a precise standard will be more likely 
to make an aggressive financial reporting decision (1.е., classify the lease as an operating lease) 
than preparers applying an imprecise standard. Preparers recorded their lease classification 
decisions on a ten-point, forced-choice scale (1 = “Definitely classify as an operating lease” and 
10 = “Definitely classify as a capital lease"). Table 1 reports results consistent with expectations. 
Specifically, preparers are more likely to report aggressively when presented with a more precise 
standard (mean = 4.98) than with a less precise standard (mean = 7.83, F = 23.36, p < 0.001). 

It is important to establish that these findings are not solely a function of our manipulation of 
precision resulting in essentially different decision contexts for the less and more precise condi- 
tions. That is, for participants in the less precise standard condition, the term “for the major part” 
is subject to interpretation. For a more meaningful comparison, participants in the less precise 
condition must be placed in a similar decision context to those in the more precise condition. 
Specifically, participants’ interpretations of “for the major part” must place them in a situation in 
which their professional judgment regarding the renewal option is meaningful (i.e., the judgment 
regarding the option results in a different lease classification decision). У Of the 47 participants in 
the less precise group, 19 indicated that they interpreted the phrase within this professional judg- 
ment range (16., as 271 percent and $80 percent). As a more restrictive test of the effect of 


3 As a manipulation check for audit committee strength, participants were asked to assess (on a seven-point scale where 
1 = Low and 7 = High) the company's audit committee along three dimensions: independence from management, 
accounting/financial expertise, and frequency of meetings. t-tests indicate that participants in the strong audit committee 
condition assessed the committee's independence (mean = 6.29), accounting/financial expertise (mean = 6.41), and 
frequency of meetings (mean — 6.63) to be significantly higher (all p's « 0.001) than participants in the weak condition 
(means of 2.93, 2.52, and 2.84 for independence, expertise, and frequency of meetings, respectively), suggesting that our 
audit committee manipulation was successful With respect to the standard precision manipulation, 85 percent of 
participants correctly identified the applicable standard. Our conclusions remain the same if participants who failed this 
manipulation check are excluded from the analysis. 

Levene's test for equality of variances indicates violations of the ANOVA assumption of homogeneity of variances for 
the lease decision (discussed later). While ANOVA results are typically quite robust to violations of this assumption 
(Box 1954; Lindman 1974), we also conduct analyses on transformations (1.е., square root, natural log, inverse function, 
and ranking transformations) of our dependent variable for assurance. Results of these analyses are consistent with the 
results presented in Table 1. 

That is, if a participant in the less precise condition interprets "for the major part" as S70 percent, then the fixed term 
of the lease alone would automatically result in capitalization. If the interpretation is > 80 percent, then the lease would 
be treated as an operating lease regardless of whether the renewal option is determined to be a bargain. In either case, 
the participant's professional judgment regarding the renewal option would be irrelevant to the classification of the 
lease. Thus, we focus this analysis on those participants whose responses were within the range that requires a “pro- 
fessional judgment" similar to that required of participants in the more precise standard condition. 

Mean and median responses for the 47 participants in the less precise group are 69.13 and 70 percent, respectively, with 
27 participants interpreting "major part" as S70 percent, 19 as within the 71 to 80 percent "judgment required" range, 
and one as > 80 percent. 
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TABLE 1 
Experiment 1: Lease Classification Decisions (Likelihood Scale) 
Panel A: Mean (Standard Deviation)* 


Audit Committee 
Standard Precision Weak . Strong | , Overal | 
Less precise 8.14 7.58 7.83 
(2.24) (2.75) (2.52) 
n= 21 п = 26 п = 47 
More precise 4.13 573 4.98 
(3.08) (3.51) (3.38) 
n = 23 n = 26 п = 49 
Overall 6.05 6.65 
(3.36) (3.25) 
n= 44 п = 52 
Panel B: ANOVA Results 
p-value” 
Model 8.76 <0.001 
Experimental Variables 
Standard Precision 23.36 «0.001 
Audit Committee 0.73 0.396 
Interaction 
Standard Precision X Audit Committee 3.19 0.039 


* Participants indicated the likelihood they would classify a lease as either an operating lease or a capital lease using а 
ten-point scale numbered from 1 ("Definitely classify as an operating lease") to 10 ("Definitely classify as a capital 
lease"). 

> Where expectations are directional, p-values are based on one-tailed tests. 





standard precision, we retest НІ using only these 19 participants (non-tabulated mean = 7.00 for 
these individuals). A comparison with responses from the more precise group (mean = 4.98) 
provides additional support for H1 (р = 0.013).!” 

To further explore the effect of standard precision, we dichotomize preparers' responses at the 
midpoint as a measure of the practical significance of their lease classification decisions. Analysis 
with the dichotomized variable reveals that participants in the more precise standard condition are 
significantly more likely to take the aggressive position of classifying the lease as an operating 
lease than those applying a less precise standard. Sixty-one percent of preparers in the more 
precise condition classified it as an operating lease versus 15 percent in the less precise condition 
(2 2 21.74, p < 0.001; Table 2), providing further support for НІ. 

Additionally, in accordance with the development of H1, participents appear to interpret the 
facts of the lease scenario in a manner consistent with their classification decisions. We find a 
significant correlation between their classification decisions and their interpretation of whether the 


17 Results are similar when the data are parsed to include only those in the less precise condition who interpret “for the 
major part” as 275 percent (i.e., participants whose interpretations are as high as, or higher than, the threshold in the 
more precise standard). 
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TABLE 2 
Experiment 1 
Contingency Analysis of Lease Classification Decision by Standard Precision Condition 
Lease Classification Decision 

Standard Precision Operating Lease Capital Lease Row Total 
Less precise 7 40 47 

Row % 14.89% 85.11% 1€0.00% 
More precise 30 19 49 

Row % 61.22% 38.78% 100.0096 
Column total 37 59 


Х = 21.74 (р < 0.001) 


Participants’ ten-point likelihood responses were dichotomized at the midpoint such that responses of 1 to 5 on the scale 
are classified as Operating Lease and responses of 6 to 10 are classified as Capital Lease. 





option to renew the lease at the end of the lease term represents a bargain (Pearson r = 0.580; 
р < 0.001). That is, participants who indicate they are more likely to capitalize the lease tend to 
believe more strongly that the renewal option represents a bargain. 


The Influence of the Audit Committee 

H2 predicts that audit committee strength will have a greater effect on aggressive reporting in 
a rules-based regime (where a strong audit committee could inhibit financial statement preparers’ 
opportunistic application of a standard) than in a more principles-based regime (where greater 
concern about regulator sanctions could lessen the burden on the audit committee to constrain 
aggressive reporting). Table 1 reports a significant and directionally consistent interacticn between 
standard precision and audit committee strength (F = 3.19, p = 0.039), providing support for H2. 
Further, non-tabulated comparisons within each of the standard precision conditions reveal that 
audit committee strength influences lease classification decisions under the rules-based regime 
(means = 5.73 and 4.13 for the strong and weak audit committee conditions, respectively, 
p = 0.049), but not under the principles-based regime (means = 7.58 and 8.14 for the strong and 
weak audit committee conditions, respectively, p = 0.441). 

An interesting finding emerges when we compare the mean responses in Table 1 in the more 
precise standard/strong audit committee condition (mean = 5.73) with those in the two less 
precise standard conditions (means = 7.58 and 8.14 for the less precise/strong and less precise/ 
weak conditions, with two-tailed p-values of 0.040 and 0.007, respectively). We find that both of 
the less precise standard conditions lead to less aggressive reporting, regardless of audit committee 
strength. While it is important to stress the context-specific nature of this result, the shift to a more 
principles-based standard appears to have a greater dampening effect on aggressive reporting than 
increasing the strength of the audit committee does in our setting. 


Comparability 

A concern with moving from more precise (rules-based) to less precise (principles-based) 
standards is that it could result in reduced financial statement comparability across firms (FASB 
2002; Nelson 2003; Schipper 2003; SEC 2003). If this concern has merit, one would expect the 
variability of lease classification decisions to be greater with a less precise standard than with a 
more precise standard. Contrary to concerns, Table 1, Panel A reveals that there is less variability 
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(as measured by the standard deviation) in the decisions made by preparers in the less precise 
standard group than in the more precise group (SD = 2.52 and 3.38, respectively). Levene’s test 
for equality of variances indicates a significant difference between the two groups (F = 15.397, 
p < 0.001, two-tailed, non-tabulated). This suggests that application of more principles-based 
standards need not result in less inter-firm comparability than more precise standards. 


IV. EXPERIMENT 2 
Concern for Sanctions and Economic Substance: A Three-Path Mediating Model 


In experiment 1, we find a significant relationship between standard precision and financial 
statement preparers’ lease classification decisions. Specifically, a less precise (more principles- 
based) standard results in less aggressive financial reporting (H1). The development of Н1 sug- 
gests two potential mediators of this relationship. With their bright-line thresholds, rules-based 
standards can invite transaction structuring and incentive-consistent standard interpretation to 
achieve preferred accounting treatments (FASB 2002; Bockus et al. 2003; Nelson 2003). Thus, as 
Maines (2007) suggests, financial statement preparers could prefer rules-based standards because 
they are believed to reduce second-guessing by regulators as to whether a standard has been 
applied opportunistically. Aggressive reporting could be more difficult with a principles-based 
standard in which a threshold is not explicitly stated, as the risk of being perceived to be out of 
compliance is greater due to the inherent uncertainty of the standard (Nelson et al. 2002). When 
there is uncertainty regarding the proper accounting, preparers might decide not to select their 
desired treatment because of the increased risk of second-guessing by regulators, as well as the 
costs accompanying such sanctions (Calfee and Craswell 1984; Craswell and Calfee 1986). This 
increased concern for second-guessing would result in a greater desire to reflect the underlying 
economics of transactions and events in the financial statements. That is, if preparers can show 
that their reporting decisions fairly represent economic reality, they will be better able to defend 
themselves when second-guessed by regulators. We propose a three-path (two-mediator) model in 
which concern about second-guessing and interest in reporting the economic substance of a trans- 
action sequentially mediate the relationship between standard precision and aggressive reporting. 
Experiment 2 is designed to explore this mediating relationship and tests the following hypothesis: 


H3: Concern about second-guessing by regulators and the desire to report the economic 
substance of the transaction will (sequentially) mediate the relationship between stan- 
dard precision and financial statement preparers’ aggressive financial reporting decisions. 


Task and Procedure 


Similar to experiment 1, participants were 92 experienced financial statement preparers (i.e., 
CFOs, Controllers, VPs of Finance) with an average of 24.9 years of professional work 
experience.’ Experimental materials were similar to those used in experiment 1. Again, we ma- 
nipulate standard precision and audit committee strength and ask participants to make a lease 
classification decision. However, we made minor modifications to the details of the lease agree- 
ment in an effort to rule out the possibility that a ceiling effect is responsible for our experiment 
1 finding that the burden on the audit committee to curb aggressive reporting eases with a move 
toward a more principles-based standard (i.e., mean responses for participants in both principles- 


18 We conducted another mailing to 1,000 financial executives, receiving 99 replies with 88 returned as undeliverable for 
a response rate of 10.9 percent. Seven responses were unusable: five were completed by inappropriately classified 
individuals (our conclusions remain the same with or without these individuals), and two were returned with no response 
on the dependent variable, resulting in 92 usable responses. Demographic variables (e.g., experience-related measures, 
years at current job) were not significantly different between conditions and, when included in our analyses, were neither 
significant nor altered the conclusions we draw. 
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based conditions were relatively close to the high, or “capitalize,” endpoint of the response scale). 
The modifications to the lease agreement were designed to allow for greater freedom for upward 
movement on the scale, such that an effect of audit committee strength would be more easily 
detectible. Specifically, relative to experiment 1, we reduced the fixed non-cancellable term of the 
lease to five years. The option to renew the lease was increased to three years, but at a rental 
payment of 92 percent of the equipment’s fair rental value. The rental payment was increased to 
make it easier for participants to conclude that the renewal option is not a bargain and thus justify 
classifying the lease as an operating lease (i.e., to move participants down the response scale to, 
hopefully, rule out the possibility of a ceiling effect driving our Н1 findings for experiment 1). 
Further, the changes in the length of the fixed term and option increase the proportion of partici- 
pants in the principles-based standard condition for whom the lease classification requires judg- 
ment. These modifications allow for a wider range of participants’ interpretations of “for the major 
part” (1.е., 251 percent and 280 percent) to place them in a situation in which their professional 
judgment regarding the renewal option is meaningful (ie., the judgment regarding the option 
results in a different lease classification decision). 

In an effort to attempt to identify the mechanisms driving our findings in experiment 1 (and 
test H3), we ask about several factors that could influence participants' lease classification deci- 
sions. Participants were asked, post-experimentally in experiment 2, to consider how much their 
decisions were influenced by five potentially influential factors (relative to other potential factors). 
These factors relate to their desire to: (1) "Report the economic substance of the lease in the 
financial statements,” (2) “Avoid possible second-guessing of my decision by the company's audit 
committee," (3) “Avoid possible second-guessing of my decision by the company's auditor,” (4) 
“Avoid possible second-guessing of my decision by external watchdogs such as the Securities and 
Exchange Commission," and (5) "Present the company's financial position and profitability as 
favorably as the circumstances will allow." 


Experiment 2 Results 

The 2 X 2 ANOVA results presented in Table 3 confirm the results of experiment 1. Again, we 
find a significant main effect of standard precision (H1), with preparers applying a more precise 
standard reporting more aggressively (mean — 3.41) than those applying a less precise standard 
(mean = 6.19, p < 0.001).? Dichotomizing participants’ responses, we find that participants in 
the more precise standard condition are significantly more likely to take the aggressive position of 
classifying the lease.as an operating lease (77.3 percent) than those applying a less precise stan- 
dard (33.3 percent; Хв 17.86, р < 0.001, non-tabulated), consistent with НІ. Further, we again 
report significantly less variability in decisions made by preparers in the less precise standard 
group than in the more precise group (SD = 1.48 and 2.20, respectively; Levene Statistic 
= 14.918, p < 0.001). This result provides additional evidence that principles-based standards 
might not reduce inter-firm comparability as some have feared. We again find a significant inter- 
action (p = 0.012) between standard precision and audit committee strength (H2). Note also that 
these results appear to rule out the possibility that a ceiling effect is responsible for our H2 results 
in experiment 1. That is, although the mean responses in the less precise standard conditions were 


9 АН participants in the less precise condition interpreted “for the major part” within the range in which their professional 
judgment regarding the renewal option is required (i.e., 251 percent and 280 percent for experiment 2; overall mean and 
median responses are 68.92 and 70 percent, respectively). Thus, it is unnecessary to conduct the more restrictive test of 
H1 we utilized for experiment 1. Further, results are similar when the data are parsed to include only those in the less 
precise condition who interpret “major part" as 275 percent (Le., as high as, or higher than, the threshold in the more 
precise standard). Eighteen of the 48 participants in the less precise condition fall into this category. 
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TABLE 3 
Experiment 2 
Lease Classification Decisions (Likelihood Scale) 
Panel A: Mean (Standard Deviation) 


Audit Committee 
Standard Precision Weak Strong Overall 
Less precise 6.13 6.24 6.19 
(1.46) (1.54) (1.48) 
п = 23 п = 25 п = 48 
More precise 2.50 4.32 3.41 
(1.79) (2.23) (2.20) 
n= 22 n = 22 п = 44 
Overall 4.36 5.34 
(2.44) (2.11) 
п = 45 п = 47 
Panel В: ANOVA Results 
F p-value” 
Model 22.16 <0.001 
Experimental Variables 
Standard Precision 56.48 <0.001 
Audit Committee 6.81 0.011 
Interaction 
Standard Precision X Audit Committee 3.13 0.012 


* Participants indicated the likelihood they would classify a lease as either an operating lease or a capital lease using a 
ten-point scale numbered from 1 ("Definitely classify as an operating lease") to 10 (“Definitely classify as a capital 
lease"). 

> Where expectations are directional, p-values are based on one-tailed tests. 


a considerable distance away from the “capital lease" endpoint of the scale (overall mean of 6.19 
on a ten-point scale), our finding that the influence of audit committee strength diminishes with a 
move toward a more principles-based standard still holds. 

Table 4 presents means for the regulator second-guessing and economic substance variables 
used to test НЗ, as well as three other potential influencing factors. Participants recorded on 
11-point scales (0 = “Little influence relative to other factors" and 10 = “Very strong influence 
relative to other factors") the extent to which their lease classification decisions were influenced by 
these factors. Consistent with our arguments leading to H3, we find that participants applying a 
less precise standard are more concerned with avoiding regulator second-guessing when making 
their lease classification decision than those applying a more precise standard (means — 6.64 and 
2.14, respectively, p « 0.001). Participants in the less precise standard condition also are more 
concerned with reporting the economic substance of the lease in the financial statements than 
participants in the more precise standard condition (means of 7.17 and 4.30, respectively, 
р < 0.001). 

H3 proposes a three-path mediation model in which concern about second-guessing by regu- 
lators and desire to report the economic substance of a transaction sequentially mediate the rela- 
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TABLE 4 
Experiment 2 
Factors Influencing the Lease Classification Decision 
Less Precise Standard More Precise Standard 
Weak Audit Strong Audit Weak Audit Strong Audit 
Committee Committee Total Committee Committee Total 

Dependent Variable — (п = 23) (n = 25) (n = 48) (n = 22) (n = 22) (n = 44) 
Regulator Second-Guessing 

Mean 6.67 6.60 6.64 1.95 2.32 2.14 

(Std. Dev.) (2.15) (1.94) (2.01) (0.99) (1.17) (1.09) 
Economic Substance 

Mean 7.09 724 7.17 3.17 4.82 4.30 

(Std. Dev.) (2.04) (1.74) (1.87) (1.97) (1.84) (1.96) 
Audit Committee Second-Guessing 

Mean 1.83 3.44 2.67 1.77 6.00 3.89 

(Std. Dev.) (0.94) (1.45) (1.46) (1.02) (1.95) (2.63) 
Auditor Second-Guessing 

Mean 3.65 3.60 3.63 3.68 3.91 3.80 

(Std. Dev.) (1.23) (1.19) (1.20) (1.32) (1.27) (1.29) 
Favorable Financial Presentation А 

Меал 3.50 3,28 3.39 7.32 6.41 6.86 

(Std. Dev.) (2.38) (2.73) (2.56) (2,32) (1.92) (2.15) 


Participants responded to the prompt “Relative to other factors, how much was your lease classification decision influenced 
by your desire to ..." on 11-point scales, where 0 = “Little influence relative to other factors" and 10 = “Very strong 
influence relative to other factors” for each of the following factors: “Avoid possible second-guessing of my decision by 
external watchdogs such as the Securities and Exchange Commission” (Regulator Second-Guessing); “Report the eco- 
nomic substance of the lease in the financial statements” (Economic Substance); “Avoid possible second-guessing of my 
decision by the company’s audit committee” (Audit Committee Second-Guessing); “Avoid possible second-guessing of my 
decision by the company’s auditor" (Auditor Second-Guessing); and “Present the company's financial position and prof- 
itability as favorably as the circumstances will allow” (Favorable Financial Position). 


tionship between standard precision and aggressive reporting. Figure 1 reports the results of our 
test of this model, which support H3. The predicted mediating relationship is demonstrated by the 
following regression results (MacKinnon and Dwyer 1993): 


(1) Standard precision significantly affects concern for regulator second-guessing (path I in 
Figure 1, b, = 4.499, р < 0.001), 
(2) Concern for regulator second-guessing influences desire to present the economic sub- 
stance of the transaction (path П, b; = 0.503, р <0.001), 
(3) Concern for reporting the economic substance of the transaction significantly influences 
aggressive financial reporting (path Ш, b4 = 0.710, р < 0.001), and 
(4) Inclusion of concern for reporting the economic substance of the transaction in the analy- 
sis reduces the effect of standard precision on aggressive reporting (path IV, b4 = 0.740 
and p = 0.030 when concern for economic substance is included in the regression versus 
b, = 2.778 and р < 0.001 when concern for economic substance is not included in the 
regression). 
Further, we find that the mediating effect predicted by H3 is significant (z = 2.35, p = 0.009; see 
MacKinnon and Dwyer 1993). Thus, this analysis identifies two mechanisms that help explain the 
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FIGURE 1 
Experiment 2: Three-Path Mediation Model and Results 
П 
by=.503 
(p<.001) 
ш 

1 b3=.710 
by=4.499 (p<.001) 


(p<.001) 


IV 
b42.740 
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Standard Precision is the treatment variable manipulated at two levels (0 = more precise; 1 = less precise). 
Aggressive Financial Reporting is a participants lease classification decision recorded on a ten-point scale, 
where 1 = “Definitely classify as an operating lease” and 10 = “Definitely classify as a capital lease.” Lower 
responses are indicative of more aggressive reporting decisions. 

Regulator Second-Guessing is a participants response to the post-experimental question “Relative to other 
factors, how much was your lease classification decision influenced by your desire to: Avoid possible second- 
guessing of my decision by external watchdogs such as the Securities and Exchange Commission" (recorded on 
an 11-рош scale where 0 = “Little influence relative to other factors" and 10 = “Very strong influence relative 
to other factors"). 

Economic Substance is a participant's response to the post-experimental question “Relative to other factors, how 
much was your lease classification decision influenced by your desire to: Report the economic substance of the 
lease in the financial statements" (recarded on an 11-point scale where 0 = “Little influence relative to other 
factors" and 10 = “Very strong influence relative to other factors”). 

b, through b, are the estimated coefficients from the following regression equations: 


Regulator Second-Guessing = By, + By (Standard Precision) + в! р 9 91 6 9. 
1 
Economic Substance = Ву + B.(Regulator Second-Guessing) + е, 


Aggressive Financial Reporting = Воз + By(Economic Substance) + B4(Standard Precision) + 83 


To determine the significance of the mediating relationship (paths I through IIT), a z-score is calculated using the 
coefficient values and their standard errors. We calculate the variance of the mediating effect (Зыльзьз) 
using a first-order Taylor series estimate (multivariate delta method) (MacKinnon and Dwyer 1993): 
z= (9:5554) / $1253 за 2.35 (р = 009). 


results we obtained in experiment 1 related to the influence of level of standard precision on 
aggressive financial reporting (H1). 


Other Factors Influencing the Lease Classification Decision 
Table 4 presents three other potential influencing factors: concern for second-guessing by the 
audit committee, concern for second-guessing by the auditor, and desire to present the company's 
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financial position as favorably as the circumstances allow. Consistent with prior archival work in 
rules-based settings, audit committee strength appears to have significant influence over partici- 
pants’ decisions when applying a more rules-based standard (means of more precise standard/ 
strong audit committee and more precise standard/weak audit committee = 6.00 and 1.77, respec- 
tively, p < 0.001). 

It is interesting to note, however, that a stronger audit committee appears relatively less 
influential in our more principles-oriented setting (means of less precise standard/strong audit 
committee condition and more precise standard/strong audit committee condition = 3.44 and 6.00, 
respectively, p < 0.001). This is consistent with our finding that our principles-based standard 
eases the burden on audit committees (whether strong or weak) to curtail management's aggres- 
sive reporting choices.”° That is, when applying our more principles-oriented standard, it appears 
that preparers are concerned primarily about regulator second-guessing. This, in turn, dampens 
pursuit of aggressive financial reporting and results in reporting choices that are more consistent 
with audit committee preferences. 

Further examination of potential influential factors in the two more precise standard cells 
suggests that, when the audit committee is weaker, financial statement preparers are more con- 
cerned with presenting the company's financial position as favorably as the circumstances will 
allow (mean = 7.32) than when the audit committee is stronger (mean = 6.41, p = 0.082), It also 
is interesting to note that presenting a favorable financial position has a much smaller influence on 
the lease classification decision for participants in the less precise standard condition than for 
participants in the more precise standard condition (means of 3.39 and 6.86, respectively, 
р < 0.001). 


У. SUMMARY AND CONCLUSIONS 

As a result of recent corporate accounting scandals, the U.S. Congress passed SOX in an 
effort to reform the U.S. financial reporting system. In turn, this legislation has pushed the SEC 
and the FASB to consider reforms designed to inhibit aggressive financial reporting. These re- 
forms include a shift toward more principles-based accounting standards and enhancing the role of 
the audit committee. We provide the first evidence on how these two SOX-related regulatory 
initiatives jointly impact financial statement preparers’ reporting judgments. Specifically, our study 
reports the results of two experiments investigating the effect of financial reporting standard 
precision on experienced financial statement preparers’ financial reporting judgments, as well as 
the potential role that audit committee strength plays in mitigating aggressive financial reporting. 

Consistent with expectations, our results show that financial statement preparers are less likely 
to report aggressively when applying a less precise financial reporting standard than when apply- 
ing a more precise standard. We also find a significant interactive effect of audit committee 
strength and standard precision on preparers’ lease classification decisions. We find that audit 
committee strength affects aggressive reporting in a rules-based regime by inhibiting financial 
statement preparers’ opportunistic application of a standard. In contrast, audit committee strength 
has no effect in a more principles-based regime where preparers are more concerned about regu- 
lator second-guessing and reporting the economic substance of a transaction regardless of audit 
committee strength. We also find support for a three-path mediating model in which concern about 
second-guessing by regulators and the desire to report the economic substance of a transaction 
sequentially mediate the relationship between standard precision and preparers’ aggressive finan- 
cial reporting. Further, we find significantly less variability among preparers’ financial reporting 


20 In contrast to the audit committee (a manipulated variable). we did not expect, nor did we find, significant differences 
between any of the four cells with respect to concern for second-guessing by the auditor (p = 0.847). 
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decisions when a less precise standard is in place. This suggests that, contrary to the concern of 
some, the application of more principles-based standards need not result in less inter-firm compa- 
rability than more precise standards. 

Our findings have implications for both practice and research. With respect to practice, our 
results provide insight into the principles-versus-rules debate and should be of interest to U.S. 
policy makers as they continue to contemplate a switch from rules-based to principles-based 
accounting standards. Our findings suggest that moving toward more principles-based standards 
(such as the SEC’s proposed adoption of IFRS) will not necessarily open the door to greater 
opportunistic reporting by financial statement preparers; instead, this shift could result in more 
economically meaningful reporting. Also, our results regarding variability in financial statement 
preparer responses can help allay concerns regarding inter-firm comparability in a principles-based 
standards regime. Further, in our setting, we find that switching to a more principles-based ap- 
proach appears to have a greater dampening effect on aggressive reporting than does strengthening 
the audit committee. This result, while context-specific, warrants further inquiry by policy makers 
and researchers. Future research could further explore the relationship between audit committee 
strength and financial reporting quality under more principles-oriented regimes. Finally, prior 
research suggests that audit partners believe principles-based standards will reduce their power in 
resolving auditor-client conflicts (Gibbins et al. 2001). However, our results suggest that the 
number of such disagreements could diminish, which should help alleviate concerns of auditors’ 
lost leverage. 

It is important to note that our experiments focus on a setting in which the rules-based 
standard has been unable to curb aggressive reporting (i.e., the lease capitalization decision), even 
given a strong regulatory environment in which there is general concern over aggressive reporting 
choices. To broaden the generalizability of our findings, future research could explore other set- 
tings involving more restrictive rules that more effectively bind financial statement preparers to a 
conservative treatment. Further research could also explore the effect of differing levels of regu- 
latory scrutiny on principles-based standards. Such research would further our understanding of 
the roles that standard precision and audit committee strength play in mitigating aggressive finan- 
cial reporting practices. 
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ABSTRACT: We provide evidence on the effects of SFAS 133 on the risk relevance of 
accounting measures of bank derivative exposures to bond markets. First, we find that 
Interest rate derivatives classified as hedging are more negatively associated with 
fixed-rate bond spreads after SFAS 133. We also find that hedging derivatives offset 
non-trading positions to a greater extent after SFAS 133. Second, for the largest 25 
banks, we find that Interest and foreign exchange rate trading derivatives are more 
negatively associated with fixed-rate bond spreads after SFAS 133, consistent with 
more economic hedges being classified as trading after SFAS 133. For these banks, 
trading derivative exposures offset поп-депма ће trading exposures to a greater extent 
after SFAS 133. Our results suggest that, contrary to critics’ claims, SFAS 133 has 
increased the risk relevance of accounting measures of derivative exposures to bond 
investors and benefited banks In terms of reducing thelr cost of capital. 
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І. INTRODUCTION 
е provide evidence on the effects of the Statement of Financial Accounting Standards 
W (SFAS) 133, Accounting for Derivative Instruments and Hedging Activities (FASB 
1998), on the risk relevance of accounting measures of bank derivative 
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exposures.’ We focus on banks because they extensively use derivatives in both hedging and 
trading activities. We assess risk relevance by examining the extent to which bank fixed-rate bond 
spreads (hereafter, bond spreads) are associated with measures of exposures from derivatives 
classified as hedging (hereafter, hedging derivatives) and derivatives classified as trading (hereaf- 
ter, trading derivatives). We also provide corroborating evidence on how SFAS 133 affected the 
risk-reducing (or risk-enhancing) role of derivatives by studying the association between changes 
in fair values of derivatives and changes in fair values of non-derivative positions. 

These research questions are important for at least three reasons. First, even though account- 
ing guidance for derivatives prior to SFAS 133 was inconsistent, incomplete, and lacking in 
transparency, there was considerable controversy over the potential benefits and costs of SFAS 
133, especially for banks (see Section II for more discussion of the accounting guidance before 
and after SFAS 133). In particular, proponents argued that, because SFAS 133 provides greater 
uniformity in accounting for derivatives and standardizes the criteria for qualifying for hedge 
accounting, it improves the transparency and reliability of derivatives reporting by banks as well 
as their monitoring of risk-management strategies involving derivatives, thereby reducing their 
cost of capital.” However, critics argued that SFAS 133 makes it more difficult to obtain hedge 
accounting treatment and exacerbates the difference between accounting and economic hedges.” 
Thus, to the extent banks record economic hedges as if they are speculative and investors penalize 
banks for these positions (Morgan and Stiroh 2001), SFAS 133 would increase banks’ cost of 
capital. Similar views were expressed by the Financial Executives Institute (1997, 2), which 
stated, “[SFAS 133] will send a highly misleading message to readers of financial statements, 
unfavorably affecting the cost of capital.” Because one of the precepts the Financial Accounting 
Standards Board (FASB) follows when issuing a standard is that “the costs imposed to meet the 
standard should be justified in relation to the overall benefits of the resulting information” (SFAS 
133, 4 232), documenting evidence on the effects of SFAS 133 is necessary for a complete 
cost-benefit analysis of this standard. 

Second, prior accounting studies focus mostly on the value relevance of bank derivative 
disclosures (Barth et al. 1996; Eccher et al. 1996; Nelson 1996; Venkatachalam 1996; Ahmed et al. 
2006), risk relevance of bank income measures (Beaver et al. 1970; Hodder et al. 2006), and risk 
relevance of disclosures to equity investors (McAnally 1996; Schrand 1997). To our knowledge, 
the risk relevance of accounting measures of derivative exposures to bond investors has not been 
examined previously. This is an important gap in the literature because (1) bonds are a much larger 
source of capital than stocks," and (2) there are important structural differences between bond and 





- 


SFAS 138 (FASB 2000), Accounting for Certain Derivative Instruments and Certain Hedging Activities—An amend- 
ment of SFAS 133, and SFAS 149 (FASB 2003), Amendment of SFAS 133 on Derivative Instruments and Hedging 
Activities, which were issued later, address certain issues related to the implementation of SFAS 133, but do not change 
the basic model of SFAS 133. 

For example, the Investor Protection Department of the Consumer Federation of America (1997, 1) states that: “[ac- 

counting standards for derivatives and hedging activities before SFAS 133] leave investors in the dark about billions of 

dollars of transactions that the companies they invest in engage in each year. [5248 133] will aid investors by increasing 
the visibility, comparability, and understandability of the risks and benefits associated with these often complex financial 
instruments.” 

3 For example, Canadian Bankers Association (1997, 2) states that “The hedging criteria in [SFAS 133] are overly 
restrictive and disallow hedge accounting in situations where economic exposures are currently being effectively 
hedged.” 

4 The responses to the Exposure Draft of SFAS 133 indicate that banks generally opposed SFAS 133 from a cost-benefit 

standpoint. For example, First Chicago NBD Corporation (1997, 13-14) states: “The standard cannot be justified from 

a cost-benefit standpoint ... [Proposed changes] are not critical enough to compensate for the cost of implementing the 

standard.” 

Flow of Funds data published by the Board of Governors of the Federal Reserve System indicate that the size of the debt 

market at the end of 2005 was about twice that of the equity market. 
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stock markets. For example, bond markets are dominated by more sophisticated participants than 
are stock markets (Bhojraj and Swaminathan 2009), but are less liquid than stock markets. This 
study jointly investigates the risk relevance of accounting measures of derivative exposures in 
bond markets and how SFAS 133 altered that relevance. 

Third, studying the effects of SFAS 133 on bond spreads is a more direct and cleaner way of 
studying the effects of SFAS 133 on the risk relevance of derivatives reporting because the 
primary driver of bond prices is risk, whereas stock prices are driven by changes in expected 
future cash flows as well as changes in risk (or discount rates). 

If derivatives are used for risk reduction, then derivative exposures would be negatively 
associated with bond spreads. We hypothesize that derivatives classified as hedging are more 
negatively associated with bond spreads after SFAS 133 because SFAS 133 uses more stringent 
criteria for classifying derivatives as hedging derivatives and requires more detailed documenta- 
tion of the hedging relationship than prior standards, thus increasing the likelihood that hedging 
derivatives consist of highly effective derivative hedges. On the other hand, trading derivatives 
will be positively (negatively) associated with bond spreads if they increase (reduce) net expo- 
sures. However, because the relative weight of derivatives used for economic hedges within the 
trading derivatives category is likely to increase after SFAS 133, we also hypothesize that trading 
derivatives will be less positively (or more negatively) associated with bond spreads after SFAS 
133 if bondholders correctly perceive the increase in the relative weight of economic hedges 
within the trading derivatives category. We expand on these arguments in Section П. 

We examine the association between bank bond spreads and accounting measures of deriva- 
tive exposures before and after SFAS 133 while controlling for bond and bank characteristics 
identified in prior studies. Our tests are based on a sample of subordinated bank bonds with U.S. 
dollar denominations. We find that, after SFAS 133, exposures from interest rate hedging deriva- 
tives are more negatively associated with bond spreads. In terms of economic magnitudes, before 
SFAS 133, the tax-adjusted bond spread is approximately 19.5 basis points lower for a bank with 
the mean level of exposure from hedging derivatives relative to a bank with no such exposure, 
whereas it is 30.8 basis points lower after the adoption of SFAS 133. Furthermore, we find that 
exposures from interest and foreign exchange rate trading derivatives are also more negatively 
associated with bond spreads after SFAS 133 for the largest banks, which are extensively involved 
in derivatives trading. For these banks, the mean level of exposure from trading derivatives is 
associated with a 5.8 basis point lower spread before, and a 9.9 basis point lower spread after 
SFAS 133, compared to banks without such exposure. 

To provide corroborating evidence on the risk-reducing role of derivatives, we investigate 
how SFAS 133 altered the relation between fair values of derivative and non-derivative exposures. 
Specifically, we hypothesize that, after SFAS 133, (1) hedging derivatives offset exposures from 
non-trading positions to a greater extent, and (2) trading derivatives offset exposures from non- 
derivative trading positions to a greater extent. Consistent with these hypotheses, we find that fair 
value changes in hedging derivatives and trading derivatives are more negatively related to fair 
value changes in non-trading positions and non-derivative trading positions, respectively, after 
SFAS 133. These results suggest that hedging derivatives offset exposures from non-trading po- 
sitions, whereas trading derivatives offset exposures from non-derivative trading positions. 

We contribute to the prior literature in two important ways. First, SFAS 133 has been one of 
the most controversial standards ever issued by the FASB. Contrary to critics’ claims, we find that 
accounting measures of derivative exposures are more negatively associated with bond spreads 
after SFAS 133 than before SFAS 133, indicating that SFAS 133 has improved the risk relevance 
of derivatives reporting, heightened investors’ confidence in the stated purposes of holding deriva- 
tives, and benefited banks in terms of reducing their cost of capital. 
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Second, prior work does not provide evidence on how bond investors price derivative expo- 
sures. We document that bond investors price both hedging and trading derivatives in a manner 
consistent with their use in offsetting non-derivative inherent risk exposures, and that these off- 
setting effects are more pronounced after SFAS 133. Our findings contrast with the results of prior 
studies (such as Barth et al. 1996; Eccher et al. 1996; Nelson 1996; Ahmed et al. 2006) that find 
that equity investors do not price derivative disclosures prior to SFAS 133, but are consistent with 
the greater sophistication of bond market investors. 

Section II presents the background and hypotheses development. Section ПІ describes the 
research design. Section IV presents the evidence and Section V the conclusion. 


П. BACKGROUND AND HYPOTHESES DEVELOPMENT 
Hedge Accounting for Derivatives before and after SFAS 133 


Prior to SFAS 133, accounting for derivatives was primarily guided by SFAS 52 (FASB 
1981), Foreign Currency Translation, and SFAS 80 (FASB 1984), Accounting for Futures Con- 
tracts, which addressed foreign currency transactions and exchange-traded futures contracts, re- 
spectively, and the Emerging Issues Task Force (EITF) Issue No. 84-36. There were several 
important inconsistencies in the requirements for hedge accounting treatment between SFAS 52 
and SFAS 80.° First, SFAS 52 required transaction-level risk reduction for derivatives to be 
classified as hedging, whereas SFAS 80 required enterprise-level risk reduction. Second, while 
SFAS 52 did not refer to correlation between changes in the values of hedging derivatives and the 
hedged items, SFAS 80 frequently emphasized a high correlation requirement for an effective 
hedge (e.g., 4 4 (b) and 11). Consequently, the Securities and Exchange Commission (SEC), under 
SFAS 80, interpreted a hedge to be effective if the cumulative change in the value of the hedging 
instrument was between 80 and 125 percent of the cumulative change in the value of the hedged 
item, but did not articulate a specific threshold for assessing hedge effectiveness under SFAS 52 or 
EITF 84-36 (Swad 1995; Carpenter 1996; Anson 1999; Finnerty and Grant 2002). Third, SFAS 52 
did not require ongoing assessment of hedge effectiveness, whereas SFAS 80 did. Finally, SFAS 
52 did not allow hedging of anticipated transactions, whereas SFAS 80 allowed such hedging. 

In addition to these inconsistencies, there were also gaps in the accounting guidance. Neither 
SFAS 52 nor SFAS 80 addressed widely used derivatives such as interest rate swaps and options 
(Beier 1993; Montesi and Lucas 1996). Hedge accounting for swaps and options was applied by 
analogy to SFAS 52 and SFAS 80 (Gastineau et al. 2001). However, depending upon the sources 
of the analogy, the classification of and accounting for derivatives were ad hoc and often incon- 
sistent across reporting entities (SFAS 133, 4 237). Some derivatives were accounted for as hedges 
regardless of whether they were part of a hedging strategy (SFAS 133, 4 235) or whether they 
were effective hedges. 

The EITF attempted to address the accounting for interest rate swaps with EITF 84-36. 
However, in contrast to the guidance in SFAS 80, enterprise-level interest rate risk reduction was 
not required by EITF 84-36. Hence, prior to SFAS 133, the inconsistencies between the effective 
standards regarding what constituted a hedge, coupled with tbe failure of those standards to cover 
widely used derivatives contracts, made disclosure of and accounting for derivatives among the 
least standardized areas of financial reporting. 

SFAS 133 was issued to provide uniformity in accounting for derivatives and standardize 
hedge accounting criteria. Regarding the former, the standard requires the recognition of all 





5 Rane (1992), Montesi and Lucas (1996), Anson (1999), Gastineau et al. (2001) and the FASB (in SEAS 133) present 
detailed discussions of the inconsistencies between SFAS 52 and SFAS 80. 
7 Jn order to increase transparency in derivatives activities, Ше FASB adopted SFAS 119 (FASB 1994), Disclosure about 
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derivatives at fair value and recognition of changes in the fair value of derivatives either in net 
income or in other comprehensive income. As for the latter, SFAS 133 standardized the criteria to 
qualify for hedge accounting and made what constitutes a hedge more precise. Most relevant to 
banks, under SFAS 133, derivatives that hedge certain types of risks (i.e., the risk of changes in the 
value or cash flows: from specific exposures (i.e., individual assets or liabilities) are classified as 
hedging and qualify for hedge accounting.® In addition, SFAS 133 requires formal and unambigu- 
ous documentation ої the hedging relationship between the derivative and the specific exposure.” 
Given that documentation of the relation between the hedged and the hedging exposures at the 
enterprise level is d-fficult, a shift in focus from enterprise-level risk reduction to specific-exposure 
risk reduction made the relation between the hedging and the hedged exposures easier to document 
and therefore more identifiable after SFAS 133. 

While SFAS 133 does not provide a specific hedge effectiveness threshold, similar to the 
SEC’s effectiveness interpretation under SFAS 80, practitioners interpret high effectiveness to 
mean that cumulative changes in the value of the hedging instrument should be between 80 and 
125 percent of the nverse cumulative changes in the value or the cash flows of the hedged item. 
Accordingly, most macro hedging derivatives do not qualify for hedge accounting under SFAS 
133 as there is no objective method of gauging the effectiveness of the hedging derivative without 
linkage to a single, identifiable asset or liability (SFAS 133, 4 447). 


Hypothesis Develcpment 

Corporate risk management theory posits that hedging reduces costs associated with external 
financing (Mayers end Smith 1982; Smith and Stulz 1985; Dadalt et al. 2002). Banks hold hedging 
derivatives to offset exposures mainly to macroeconomic risk factors, particularly interest and 
foreign exchange rates. Therefore, such derivatives reduce a bank’s systematic risk by making its 
cash flows less dependent on macroeconomic factors (Froot et al. 1993; Gay and Nam 1998), and 
reduce a bank's default risk by shielding its capital from losses due to unanticipated movements in 
market risk factors. Given the finding of Elton et al. (2001) that bond spreads consist mainly of 
systematic and defeult risk premia after tax premium is removed, derivative hedges likely result in 
lower bond spreads to the extent they are effective. 

As discussed earlier, prior to SFAS 133, many derivatives were accounted for under EITF 
84-36 or by analog; to SFAS 52, both of which conflicted with SFAS 80 in terms of risk reduction 
and did not contaim an explicit hedge effectiveness criterion. Derivatives classified as “hedging” 
after SFAS 133 likely meet a higher hedge effectiveness threshold “оп average" than do deriva- 
tives classified as Fedging before SFAS 133 because SFAS 133 (1) links hedging derivatives with 
specific identifiable exposures, (2) standardizes the hedge effectiveness criteria, and (3) requires 
explicit documentetion and ongoing assessment of the hedging relationship. The more stringent 
qualification and cocumentation rules imposed by SFAS 133 are likely to increase investors’ 
confidence in the ztated purpose of holding hedging derivatives and improve the monitoring of 
hedging activities, resulting in less uncertainty in the financial statements and lower bank bond 
spreads. This leads to our first hypothesis: 





Derivative Financiar Instruments and Fair Value of Financial Instruments. SFAS 119, which was superseded by SFAS 

133, regulated only tae derivative-related disclosures, but neither changed the existing accounting for derivatives nor the 

criteria for qualifyinz as a hedging derivative. Instead, it relied on the classification criteria provided by SFAS 52 and 

SFAS 80. 

SFAS 133 also allows hedging of off-balance sheet firm commitments and forecasted transactions. 

? Documentation under SFAS 133 should include identification of (1) the hedging derivative, (2) the hedged asset or 
liability, (3) the nature of the risk being hedged, and (4) how the effectiveness of the hedge is assessed. 
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Н1: Bank bond spreads are more negatively related to derivatives classified as hedging after 
SFAS 133 than before SFAS 133. 


Bank trading derivatives consist of customer-related positions, proprietary positions, and eco- 
nomic hedges. Banks typically take offsetting positions in customer-related derivatives trading. By 
matching customer-related derivatives in terms of market risk exposure, a bank can generate fee 
income without assuming any market risk. Although not as common, some banks also take non- 
offset positions in customer-related derivatives trading (Puwalski 2003). Hence, whether 
customer-related derivatives increase or reduce overall market risk depends on whether the re- 
sidual market risk, if any, from these positions offsets or adds to other risk exposures. Unlike 
customer-related derivatives, proprietary trading derivatives are likely to increase exposure be- 
cause they are held for speculation. Lastly, some derivatives classified as trading are in fact 
economic hedges of other exposures, ^ Therefore, bank derivatives classified as trading include 
risk-reducing positions (1.е., economic hedges), risk-enhancing positions (1.е., speculative posi- 
tions), and positions with ambiguous risk implications (i.e., customer-related positions). However, 
the relative weights of these positions are not disclosed under SFAS 119 (FASB 1994) or under 
SFAS 133. Thus, the relation between bond spreads and trading derivatives could be negative or 
positive, depending upon whether the net trading derivatives positions reduce or increase overall 
risk. However, the relative importance of economic hedges within trading derivatives is likely to 
increase after SFAS 133 for at least two reasons. First, as discussed earlier, SFAS 133 prohibits the 
reporting of derivatives used in macro hedging as hedging derivatives, and requires them to be 
classified as trading derivatives. Second, some derivatives used by banks in de facto hedging 
transactions are classified as trading because they are no longer defined as hedging derivatives, or 
are not sufficiently effective under SFAS 133. Such derivatives include hedges of interest rate risk 
in held-to-maturity securities (SFAS 133, { 21 (d) and 4 29 (e)), long-term core deposits (SFAS 
133, $ 317 and 4 437), and prepayment risks of financial assets or liabilities (SFAS 133, 4 29 (Б)). 

Banks commonly macro-hedge gap positions or value-at-risk. Therefore, they opposed the 
disqualification of macro hedges from hedge accounting in their response letters to the Exposure 
Draft for SFAS 133. The general view of the banking industry was that reporting some legitimate 
economic hedges as trading, as required by SFAS 133, would generate potentially misleading and 
unfavorable results when applied to banks (America’s Community Bankers 1997). While these 
economic hedges are typically less effective hedges than the derivatives that qualify for hedge 
accounting under SFAS 133, they are more effective hedges than derivatives classified as trading 
prior to SFAS 133. Therefore, if creditors price trading derivatives in a manner consistent with 
these derivatives including more economic hedges after SFAS 133, a more negative (or less 
positive) association between bank bond spreads and trading derivatives will result. Thus, we test 
the following hypothesis: . 


H2: Bank bond spreads are more negatively (or less positively) related to derivatives classi- 
fied as trading after SFAS 133 than before SFAS 133. 


Derivatives that hedge exposures from trading positions are required to be classified as trading 
(SFAS 119, 4 47), whereas derivatives that hedge exposures from non-trading assets and liabilities, 
such as loans and deposits, are to be classified as hedging. One of the presumptions underlying H1 
is that derivatives classified as hedging after SFAS 133 are, on average, more effective hedges than 





10 Бог example, in its 10-K for 1999, Compass Bancshares, Inc. states: "The primary purposes for using interest rate 
derivatives in the trading account are co facilitate customer transactions and to help protect cash market positions in the 
trading account against interest rate movement. [Some options in trading account] assist in protecting the fair value of 
the trading account against rising short-term interest rates while maintaining limited risk to declining rates.” (emphasis 
added) 
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derivatives classified as hedging before SFAS 133 because SFAS 133 restricts derivatives classi- 
fied as hedging to highly effective derivative hedges of specific exposures. Therefore, we expect 
derivatives classified as hedging to offset the corresponding hedged items to a greater extent after 
SFAS 133, which leads to the following hypothesis: 


H3: The relation between change in fair value of derivatives classified as hedging and change 
in fair value of net non-trading assets is more negative after SFAS 133 than before SFAS 
133. 


Unlike fair values of hedging derivatives, fair values of trading derivatives need not covary 
with other existing exposures in any specific direction to be classified as trading under either SFAS 
119 or SFAS 133. Therefore, whether trading derivatives offset or add to the exposures from 
non-derivative trading or non-trading positions is an empirical question. Liu et al. (2004) do not 
find significant cross-hedging between trading and non-trading exposures. On the other hand, 
trading derivatives include derivatives that are economic hedges of other trading positions. SFAS 
133 likely increases the average weight of such economic hedges in bank derivative portfolios 
classified as trading for two reasons. First, as mentioned earlier, some derivatives that are classi- 
fied as hedging prior to SFAS 133 are classified as trading after SFAS 133. Because recognizing 
the hedging derivative at fair value, but the hedged position at cost would create income volatility, 
SFAS 133 allows banks to transfer such hedged items into the trading category along with deriva- 
tives hedging these positions (SFAS 133, 4 54). This likely increases the weight of derivative and 
non-derivative positions with offsetting exposures in the trading portfolio.!! Second, banks could 
choose to avoid designating some derivatives as hedging and instead classify them as trading after 
SFAS 133 because the costs associated with the more onerous documentation and more complex 
hedge accounting requirements under SEAS 133 may outweigh the benefits.’* This reasoning leads 
to the following hypothesis: 


H4: The relation between change in fair value of derivatives classified as trading and change 
in fair value of net non-derivative trading assets is more negative (or less positive) after 
SFAS 133 than before SFAS 133. 


IIL RESEARCH DESIGN AND EMPIRICAL MODELS 
Model for Testing Hypotheses 1 and 2 


Hypotheses 1 and 2 posit that the relation between accounting measures of derivative expo- 
sures and bond spreads will differ before and after SFAS 133. We test these hypotheses with a 
model that relates accounting measures of derivative exposures to bond spreads, controlling for 
bond characteristics (Hancock and Kwast 2001), bank characteristics (Flannery and Sorescu 
1996; Morgan and Stiroh 2001; Hancock and Kwast 2001; Jagtiani et al. 2002), and off-balance 
sheet instruments other than derivatives that are potentially used as economic hedges. We estimate 
the model separately for the periods before and after SFAS 133. The model is specified as follows: 


Tu= Bo + BiBig? + BolrHdg), + ВзВіе? + Нави + ByDeltalrHdg,, + BsBig? + DeltalrHdg,, 
+ ВеЕхНав + B;Big? * FxHdgy + BgDeltaFxHdg;,+ ByBigi, ж DeltaFxHdgy, 
+ BiolrTrdy + By Big? ж IrTrd; + By,DeltalrTrd, + В3Вів ж DeltalrTrd;,+ Bi4FxTrd;, 


Ч. Reclassification of some securities is allowed under SFAS 133 to mitigate the likely effect of the standard on earnings 
volatility. Recognizing both the derivatives and the hedged items at fair value after SFAS 133 allows the offsetting gains 
and losses from the hedging and the hedged positions to be recognized in income concurrently. 

12 [n line with this view, a survey by Bank of America (2006) revealed that a small but growing number of banks opted not 
to apply hedge accounting to certain derivative hedges because the cost and effort of effectiveness testing and docu- 
mentation required under SFAS 133 are too high. 

13 We restrict the sample to subordinated bank bonds with U.S. dollar denominations that are all non-callable, non-putable, 
non-sinking fund, and non-convertible. Therefore, bonds in our sample vary in maturity and amount only. 
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+ BisBig? ж FxT rd,  BygDeltaFxTrd,, + By,Big2! * DeltaFxTrd,,+ BygMaturity;, 

+ ВууАтоит, + BogSizey + B5 Insured; + ВоЕзиие + Во МР; + B4 Gapj, 

+ BosNonDerTrd,, + Вэ Сар, Вэ? Коад + BogCrRisk;,+ Воді юалСотт,, 

+ PyoGuarantee;, + Вау Стрету + Вазутру + Вајт! + ёр, (1) 
where for bank i at time t: 


гү = tax-adjusted spread; 
Big; = 1 if the bank is one of the largest 25 banks in the U.S. with respect to asset 
size, and 0 otherwise; 
IrHdgj, = absolute value of net long notional amount of interest rate derivatives classified 
as hedging scaled by total assets; 
ЕхНару = absolute value of net long notional amount of foreign exchange derivatives 
classified as hedging scaled by total assets; 

DeltalrHdg,, = absolute value of the annual change in the fair value of interest rate derivatives 
classified as hedging divided by the annual change in the interest rates scaled 
by total assets; 

DeltaFxHdg,,= absolute value of the annual change in the fair value of foreign exchange 
derivatives classified as hedging divided by the annual change in the foreign 
exchange rates scaled by total assets; 

IrTrd,, = absolute value of net long notional amount of interest rate derivatives classified 
as trading scaled by total assets; 

FxTrd,, = absolute value of net long notional amount of foreign exchange derivatives 
classified as trading scaled by total assets; 

DeltalrTrd,, = absolute value of the annual change in the fair value of interest rate derivatives 
classified as trading divided by the annual change in the interest rates scaled by 
total assets; 

DeltaFxTrd, = absolute value of the annual change in the fair value of foreign exchange 
derivatives classified as trading divided by the annual change in the foreign 
exchange rates scaled by total assets; 

Maturity, = natural logarithm of the bond's years to maturity; 
Атоит = natural logarithm of the size of bond issue in million dollars; 
Size; = natural logarithm of bank's total assets; 
Insured,, = ratio of total insured deposits to total deposits; 
Estate; = real estate acquired in satisfaction of debt previously contracted scaled by total 
assets; 
Ар, = sum of nonaccrual loans and accruing loans past due 90 days ог more scaled by 
total assets; 
Gap;, = absolute value of the difference between bank assets and liabilities maturing or 
repricing within one year scaled by total assets; 
Roa; = annual net income scaled by total assets; 
Cap; = ratio of book value of equity to risk-weighted assets; 

NonDerTrd;, = fair value of non-derivative trading assets scaled by total assets; 

CrRisk;, = bank's credit losses incurred on off-balance sheet derivative contracts scaled by 


total assets; 

LoanComm,, = amount of loans for which official promises to lend are conveyed scaled by 
total assets; 
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Guarantee; = on- and off-balance sheet guarantees and insurance contracts that back financial 
obligations scaled by total assets; 
СтРету = notional amount of credit derivatives on which the bank is the beneficiary 
scaled by total assets; 
Strip; = fair value of principal-only strips scaled by total assets; and 
Іті, = Inverse Mills ratio obtained from the first-stage multinomial logit model. 


We provide the list of data sources and discuss the rationale for including each of the control 
variables in Appendix A. 

Interest rate swaps and options constitute the majority of bank derivative portfolios. Ryan 
(2007) indicates that nonlinearity of option payoff structure makes the assessment of options' 
effectiveness as hedging instruments difficult when the hedged item has a linear payoff structure. 
He also indicates that bank market risk disclosures, which are typically Iinear characterizations of 
exposures, cannot sufficiently reflect nonlinear exposures. Also, accounting for different types of 
derivatives under different pronouncements (e.g., forward under SFAS 52, interest rate swaps 
under EITF 84-36) resulted in differences in hedge accounting for different types of derivatives 
prior to SFAS 133. Therefore, the effect of SFAS 133 could vary based on the type of derivative. 
In additional sensitivity analysis, we examine whether a particular type of interest rate derivative 
drives our results. We re-estimate the model presented in Equation (1) only for interest rate 
derivatives while allowing the coefficients on (linear) swaps and (non-linear) options to vary. 

Since most of the derivatives activity in the U.S. is dominated by the largest 25 banks, we 
allow the coefficients on the derivative exposure variables to vary across the largest 25 banks and 
the remaining banks in order to control for the potential difference in risk relevance of information 
on derivative exposures for these two groups of banks. As hypothesized in H1, we expect more 
negative coefficients on the accounting measures of exposures from derivatives classified as hedg- 
ing (Le., Во + a, B4- Bs, Bg + Ву, and Вз + Bo for the largest 25 and В», B4, Bs, and Вз for the 
remaining banks) after the adoption of SFAS 133. Also, as discussed before, whether trading 
derivatives are positively or negatively related to bond spreads is an empirical question. However, 
as hypothesized in H2, we expect the coefficients on the accounting measures of exposures from 
derivatives classified as trading (1.е., Bjo+ 811, Вг + Вз, B14 + Віз» and В + Віт for the largest 
25, and В.о, Во, B14, and Вів for the remaining banks) to be more negative (or less positive) after 
SFAS 133. 

Given our researcb questions, our study naturally focuses on the subsample of banks tbat use 
derivatives. To the extent that the characteristics of derivative-user banks systematically differ 
from those of non-derivative-user banks, an empirical model that ignores the non-randomness of 
this subsample likely suffers from selection bias. We control for the potential effects of selection 
bias by using a standard extension of the Heckman two-stage procedure generalized to polychoto- 
mous choices developed by Lee (1983), where the bond-pricing model presented in Equation (1) 
constitutes the second stage. In the first stage, we classify banks into three categories: banks 
without derivatives (Nonuser), banks with only hedging derivatives (Hedger), and banks with both 
hedging and trading derivatives (Trader). Because Trader banks are typically those that engage in 
derivatives activities extensively, the last two categories jointly control for the extent of derivative 
use and the type of activity (i.e., hedging versus trading). We include the Inverse Mills ratios (111) 
obtained from the multinomial logit model in the bond-pricing model. 

Finance theory posits that banks with smaller asset portfolio size (Size), lower leverage (Lev), 
lower interest rate risk (Gap), lower non-performing loans (Мр, less convex tax functions (Con- 
vexity), higher net interest margin (Nim), and stronger managerial incentives to take risk (Incen- 
tive) are less likely to use derivatives for hedging purposes. Prior finance studies also suggest that 
the likelihood of banks to use derivatives for trading purposes increases with Size, liquidity 
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(Liquidity), the size of the commercial and industrial loan portfolio relative to total assets (Co- 
тіпаї дап), and decreases with Lev and Nim. We specify Hedger as the base category in the 
following first-stage multinomial logit model: 


Dy(NonuserlHedger) = a + aBig? + ajSize; + амер, + адбару + asNim, + аб МР, 
+ a,Liquidity,,+ agincentive; + os Convexity,, + азуСотїтаГ ап, 
+ Ey (2a) 


D,,(Trader/ Hedger) = бу + 0,Big? + 6Віз > + @,Size;, + O,Lev;, + OsGap;, + Мт + В МР, 
+ OLiquidity;, + OIncentive, + 9оСопиехиух + 0j) ComIndLoan;, + £j, 
(2b) 


where, for bank i at time г, Dy equals 0 if bank і is a Hedger in both Equations (2a) and (2b), 
equals 1 in Equation (2a) if bank i is a Nonuser, and equals 1 in Equation (2b) if bank i is a Trader. 
Big” equals 1 if the observation belongs to one of the largest 25 banks in terms of asset size, and 
0 otherwise in Equation (2a). Big? (В:р“ =) equals 1 if the observation belongs to one of the 
largest three (four to 25) banks in terms of asset size, and 0 otherwise in Equation (2b). We 
discuss the rationale for each of the variables in Appendix B. 

Convexity serves as an exclusion restriction in our two-stage estimation procedure because, 
while both finance theory and empirical evidence suggest a direct link between tax convexity and 
derivative usage (Nance et al. 1993; Tufano 1996; Géczy et al. 1997; Graham and Rogers 2002), 
it is unlikely that tax convexity and bond spread are directly linked. 


Measurement of Bond Yield Spread 

Following Elton et al. (2001), we use a tax-adjusted measure of bond yield spread that 
eliminates distortions stemming from mismatches of duration and convexity between bank bonds 
and benchmark Treasury bonds. We present the details of this technique in Appendix C. 


Measurement of Derivative Exposure 

Ideally, a measure of derivative exposure disaggregates notional and fair values by long and 
short positions taken, class of instrument, time to maturity, degree of leverage, and level of 
complexity (Hull 1993; Wong 2000). However, neither SFAS 119 nor SFAS 133 requires disclo- 
sures detailed enough to obtain an ideal measure of derivative exposure. Wong (2000) suggests a 
comprehensive measure of derivative exposure within the constraints of the limited available 
disclosures. This measure includes the size of the portfolio exposed to market risk (measured as 
the notional value of long derivatives net of short derivatives) and the delta of the portfolio (i.e., 
the rate of change of the portfolio value with respect to a change in the underlying risk factor) 
because derivative exposure is driven by both these variables. 


№ Relevant studies from the finance literature are cited in Appendix B in the discussion of variables used in the multino- 
mial logit model. 

15 Equation (22) does not include separate, binary size variables for the largest three and largest four to 25 banks because 
none of the largest three banks is in the Nonuser or Hedger category. 

15 Imposing an exclusion restriction in the Heckman procedure is critical because the absence of such a restriction can lead 
to severe multicollinearity problems in the second-stage estimation (Francis et al. 2010). 

17 The tax and accounting literatures identify various settings where taxes are impounded in the prices of assets with 
tax-favored (or tax-disfavored) returns (Shackelford and Shevlin 2001). Holders of corporate bonds pay state and local 
taxes on interest payments, whereas holders of Treasury bonds do not. Therefore, the asymmetric tax treatment of 
corporate and Treasury bond interest payments likely results in corporate bonds offering higher pre-tax return to yield 
the same after-tax return as Treasury bonds. 
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Our measures of derivative exposure, which account for the complementary relation among 
notional amounts, fair values, and past changes in the underlying risk factors (e.g., interest rates), 
are based on Wong (2000). For hedging derivatives, we measure portfolio size as the net long 
notional amount (/rHdg and FxHdg) because banks typically disclose long and short positions in 
such derivatives when providing information about their hedging activities.'® However, more than 
half the sample banks with trading derivatives do not disclose long and short positions in such 
derivatives separately. For these banks, we estimate the extent of the net exposure to market risk 
using the technique employed by the Office of the Comptroller of the Currency.” We compute the 
deltas of the portfolio for each group of derivatives (DeltalrHdg, DeltaFxHdg, DeltalrTrd, and 
DeltaFxTrd) as the change in fair value of the derivative portfolio divided by the change in the 
underlying market risk factor (interest rate or foreign exchange rate) during the fiscal year 29?! 


Models for Testing Hypotheses 3 and 4 


The bond-pricing models provide evidence on how bond investors perceive and price ac- 
counting measures of derivative exposures before and after SFAS 133, but do not directly test 
whether and how SFAS 133 altered the risk-offsetting relation between derivative and non- 
derivative exposures. Studies that examine how derivative exposures covary with other exposures 
typically examine the association between fair value gains/losses from derivatives and fair value 
gains/losses from non-derivative exposures (e.g., Venkatachalam 1996; Liu et al. 2004). Following 
Liu et al. (2004), we (1) regress the quarterly change in the net fair value of hedging derivatives 
(ADerHedging) on the quarterly change in fair value of net non-derivative trading assets (ANon- 
DerTrading), and (2) regress the quarterly change in the net fair value of trading derivatives 
(ADerTrading) on the quarterly change in fair value of net non-trading assets (ANonTrading), 
respectively. A negative (positive) association of the dependent variable with ANonDerTrading 
and ANonTrading suggests that derivatives hedge (add to) the risks from non-derivative trading 
and non-trading exposures, respectively. We estimate the following models for the periods before 
and after SFAS 133: 


ADerHedgingy, = Yo + уу йн 8 + yANonDerTrading,, + уз Нів НН ж ANonDerTrading;, 
+ y4ANonTrading;, + у оће ж ANonTrading, + ец. (3) 


ADerTrading; = М + M High} + XANonDerTradingj, + АЗЫ ж ANonDerTradingy, 
+ МА МопТтадтр у + XHigh?" ж ANonTrading), + ец. (4) 


where High"@8 (High!'?) equals 1 if the notional amount of derivatives classified as hedging 
(trading) scaled by beginning book value of equity is above the sample median for the year, and 
0 otherwise. АП other variables are scaled by beginning book value of equity. We interact Highs 
and High™ with the fair value changes in non-derivative positions to control for potential differ- 
ences in the covariation of derivative and non-derivative exposures across banks that hedge more 


18 For the few sample banks that hold basis swaps, we add the notional values of the basis swaps to the absolute difference 
between the notional values of long and short derivatives. 

19 Tn its derivative reports, the Office of the Comptroller of the Currency estimates the extent of non-offset derivative 
exposure to risk by dividing the absolute value of the net fair value of the derivative portfolio by the absolute value of 
the gross fair value of the derivative portfolio that is exposed to that specific risk. 

20 The terms of the interest rates used in the calculation of interest rate derivative portfolio deltas are based on the weighted 
average maturity of interest rate derivatives estimated using Call Report data. Then we compute the yield on a synthetic 
Treasury bond with the same maturity using the parametric yield curve specification suggested by Gurkaynak et al. 
(2006). 

?! The changes in foreign exchange rates are based on the Nominal Broad Dollar Index released by the Federal Reserve. 
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versus less and banks that trade more versus less, respectively, because banks with larger portfo- 
lios of derivatives are more likely to be impacted by SFAS 133. As predicted in H3, we expect the 
coefficient on ANonDerTrading in Equation (3) (i.e., у for banks with less and у, + уз for banks 
with more hedging derivatives) to be more negative after SFAS 133 because we expect hedging 
derivatives to comprise more effective hedges on average after SFAS 133. We also expect the 
coefficient on ANonTrading in Equation (4) (i.e., X4 for banks with less and А, + As for banks with 
more trading derivatives) to be more negative (or less positive) as predicted in H4 because we 
expect SFAS 133 to increase the weight of derivative and non-derivative positions with offsetting 
exposures in the trading portfolio. 


IV. EVIDENCE 
Sample Selection 

The primary criteria for inclusion in the sample are use of derivative instruments and avail- 
ability of bond issue data. To ensure that financial data are available at the time of bond issuance, 
we match the Call Report (FR Y-9C) and 10-K data with the data for the first subordinated bond 
issue after the date of the 10-K filing. Consistent with the prior bond-pricing literature, our bond 
sample consists of subordinated bonds because subordinated bond spreads' greater sensitivity to 
credit and market risks facilitates more powerful tests of the relation between bank bond spread 
and accounting risk measures. We obtain bond issue data from the Mergent Fixed Income Secu- 
rities Database (FISD). 

We restrict our sample period to two years before and two years after SFAS 133 in order to 
mitigate the possibility that our results are driven by factors other than SFAS 133.2 Using the 
Bank Compustat Database, we first identify 710 banks including both national commercial banks 
(SIC 6021) and state commercial banks (SIC 6022) that were publicly traded between 1999 and 
2003. Using the Federal Reserve Bank of Chicago's Bank Regulatory Database, we identify 165 of 
these 710 banks as derivative users. We eliminate 24 of the 165 user banks due to lack of 
derivative disclosures in their 10-Ks, resulting in a final sample of 141 banks. We retain only one 
subordinated bond issue per bank in each year. These criteria yield a sample of 270 bond issues for 
the pre-SFAS 133 period and 265 bond issues for the post-SFAS 133 period. 


Descriptive Statistics 

We present descriptive statistics on the composition of sample bank derivative portfolios 
based on gross notional amounts in Panels A through D of Table 1. АП 141 sample banks hold 
hedging derivatives, whereas only 52 (67) banks hold trading derivatives before (after) SFAS 133. 
As expected, banks carry mostly interest rate and foreign exchange derivatives. As presented in 
Panel A (Panel C), interest rate derivatives constitute the largest portion of derivatives, comprising 
92.5 (67.4) percent and 88.3 (73.9) percent of hedging (trading) derivatives before and after SFAS 
133, respectively. The weight of foreign exchange derivative contracts is higher in trading deriva- 
tives compared to hedging derivatives because the former include derivatives sold to customers 
that manage foreign exchange risk exposure. Swaps are the primary component of interest rate 
derivatives classified as hedging and as trading, whereas forwards are most prevalent among 
foreign exchange derivatives (Panels B and D, respectively). 

Table 2 reports descriptive statistics for the dependent and independent variables employed in 
the bond-pricing tests. Tax-adjusted bond spread has a sample mean (median) of 70 (62) basis 





21 The tests for the pre-SFAS 133 (post-SFAS 133) period are based on financial statement data for fiscal years ending 
December 31, 1999 and 2000 (2002 and 2003). We exclude the year 2001, the first effective year of SFAS 133, because 
the annual change in fair values of hedging and trading derivatives from year 2000 to 2001 is distorted by reclassifi- 
cation of some derivatives into trading and hedging by SFAS 133. 
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TABLE 2 
Descriptive Statistics for Variables Employed in Bond-Pricing Tests 
Variables Mean Median Minimum Maximum 
Dependent Variable 
Tax-Adjusted Spread (96) 0.702 0.624 0.169 8.482 
Derivative Exposure Variables Based on Portfolio Size 
IrHdg 0.085 0.079 0.005 0.603 
FxHdg 0.001 0.000 0.000 0.068 
Ira 0.045 0.030 0.003 0.716 
FxTrd* 0.012 0.002 0.000 0.452 
Derivative Exposure Variables Based on Portfolio Delta 
DeltalrHdg 0.004 0.004 0.000 0.025 
DeltaFxHdg 0.001 0.000 0.000 0.004 
DeltalrTrd* 0.001 0.000 0.000 0.017 
DeltaFxTrd* 0.001 0.000 0.000 0.010 
Control Variables 
Maturity 11.785 10.000 1.000 30.000 
Amount 17.443 17.076 6.451 24.635 
Size 15.901 15.844 12.218 20.447 
Insured 0.411 0.479 0.232 0.867 
Estate 0.001 0.000 0.000 0.015 
Мр! 0.007 0.006 0.000 0.057 
Сар 0.181 0.152 0.000 0.689 
Коа 0.012 0.011 —0.061 0.077 
Сар 0.076 0.078 0.033 0.237 
NonDerTrd 0.042 0.019 0.000 0.266 
CrRisk 0.000 0.000 0.000 0.001 
LoanComm 0.064 0.056 0.000 0.465 
Guarantee 0.007 0.000 0.000 0.153 
CrDer 0.005 0.000 0.000 0.254 
Strip 0.001 0.000 0.000 0.021 


The sample represents 535 bank-year observations from 1999 to 2003 (excluding 2001) for 141 banks. 

All control variables are as defined in Appendix A. 

5 Statistics presented are based only on banks with trading derivatives. 

Variable Definitions: 

IrHdg (FxHdg) = absolute value of net long/short notional value of interest rate (foreign exchange) hedging 
derivatives scaled by total assets; 
IrTrd (FxTrd) = absolute value of net long/short notional value of interest rate (foreign exchange) trading 

derivatives scaled by total assets; 

DeltalrHdg (DeltaFxHdg) = portfolio delta of interest rate (foreign exchange) hedging derivatives; and 

DeltalrTrd (DeltaFxTrd) = portfolio delta of interest rate (foreign exchange) trading derivatives. 


points. The mean ratio of net notional amount of interest rate hedging derivatives to total assets 
(IrHdg) is 8.5 percent, whereas the mean ratio of gross notional amount of interest rate hedging 
derivatives to total assets (unreported) is 12.8 percent. For banks with trading derivatives, the 
mean ratio of net notional amount of interest rate trading derivatives to total assets (17774) is 4.5 
percent, whereas the mean ratio of gross notional amount of interest rate trading derivatives to 
total assets (unreported) is 163.4 percent. These statistics indicate that, although the volume of 
trading derivatives is significantly greater than the volume of hedging derivatives, the size of net 
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exposure from hedging derivatives (IrHdg) is greater, on average, than the size of net exposure 
from trading derivatives (JrTrd), suggesting that banks take offsetting positions in trading deriva- 
tives to a great extent. 

Unreported correlation analyses generally yield insignificant correlations between tax- 
adjusted spread and the variables of interest. One exception is JrHdg, which is negatively corre- 
lated with tax-adjusted spread both before (г = —0.102; p = 0.094) and after SFAS 133 
(r = —0.123; p = 0.046). Also, the insignificant correlations (unreported) among the variables of 
interest suggest that multicollinearity between these variables of interest is unlikely to significantly 
affect our estimation results. 


Results of the First-Stage Multinomial Logit Model 


Table 3 contains results of the multinomial logit regressions for the Nonuser and the Trader 
categories. The results indicate that Nonuser, Hedger, and Trader banks systematically differ. 
Specifically, the results of the multinomial logit regression for Nonuser status indicate that banks 
less likely to use derivatives have a smaller asset portfolio (Size), lower probability of financial 
distress (Lev), lower interest rate risk exposure (Gap), lower credit risk (Мрі), executive compen- 
sation plans with stronger incentives to take risk (Incentive), and less convex tax functions (Con- 
vexity), relative to Hedger banks. The results of the multinomial logit regression for Trader status 
indicate that banks more likely to use derivatives.to a large extent and engage in derivatives 
trading have a larger asset portfolio (Size), lower probability of financial distress (Lev), lower net 
interest margin (Nim), and larger portfolio of commercial and industrial loans (ComIndLoan), 


TABLE 3 
Multinomial Logit Results 
Nonuser/Hedger Trader/Hedger 
Predicted Sign Coefficient p-value Predicted Sign Coefficient | p-value 

Big? = -0.099 0.133 

Big? + 0.211 0.008 
Bigt + 0.135 0.002 
Size = —0.418 <0.001 + 0.365 0.004 
Lev = -0.007 0.003 = —0.005 0.025 
Gap 25 —0.400 0.016 —/+ 0.246 0.240 
Nim —/+ —3.079 0.007 = —1.911 0.060 
Мрі = —1.284 0.002 —/+ 0.921 0.084 
Liquidity —/+ 0.609 0.247 + 0.035 0.162 
Incentive + 0.048 <0.001 —/+ 0.033 0.181 
Convexity - -0.005 0.053 —/+ 0.002 0.014 
ComIndLoan —/+ 0.152 0.042 + 0.542 0.072 
Log Likelihood — 196.224 -368.357 


The sample represents observations from 1999 to 2003 (excluding 2001). 


The sample includes 710 publicly traded banks, 141 of which are derivative users. The tests are based on 10,920 bank- 
quarters: 8,720 observations belong to Nenuser banks, 1,268 observations belong to Hedger banks, and 932 observations 
belong to Trader banks. 

All variables are as defined in Appendix B. 
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relative to Hedger banks. These findings are in line with risk management theory and the findings 
of prior empirical studies.” 


Results of Tests of Hypotheses 1 and 2 


The data set used in this study contains observations on multiple banks across multiple years. 
To estimate standard errors that are robust to both time-series and cross-sectional correlation, all 
results are based on standard errors clustered simultaneously by bank and by year. 

Table 4, Panel A reports estimation results for the model presented in Equation (1). The 
coefficient signs on the control variables are generally consistent with those reported in prior 
studies. Іті, the Inverse Mills ratio obtained from the multinomial logit model, is statistically 
different from zero both before and after SFAS 133 (p « 0.010 and p « 0.002, respectively), 
underscoring the necessity to control for potential self-selection bias. 

We find negative and significant associations between tax-adjusted spread and both account- 
ing measures of exposure from interest rate hedging derivatives, IrHdg and DeltalrHdg, before 
and after SEAS 133." These findings are consistent with such derivatives being used for hedging 
purposes and the complementary relation between derivative notional amounts and fair values in 
capturing derivative exposures as pointed out by Wong (2000). To test НІ and H2, we test the 
equality of coefficients on derivative exposure measures across pre- and post-SFAS 133 periods 
for the largest 25 and the remaining banks. The results of these tests are presented in Table 4, 
Panel B. For both subsamples, as hypothesized in H1, the associations of tax-adjusted bond spread 
with IrHdg and DeltalrHdg are more negative after SFAS 133 (р = 0.018 and р = 0.015 for the 
largest 25 banks; p = 0.001 and p = 0.032 for the remaining banks, respectively). These findings 
are consistent with derivative hedges that are more effective, on average, being classified as 
hedging after SFAS 133. 

The insignificant coefficients on the interactions of IrHdg, DeltalrHdg, FxHdg, and DeltaFx- 
Нав with Big? before and after SFAS 133 (presented in Table 4, Panel A) suggest that the pricing 
of hedging derivatives in bond markets does not differ between the largest 25 banks and the 
remaining banks.” 

The results in Table 4, Panel B also show that the associations between tax-adjusted spread 
and both Irfrd and DeltalrTrd are more negative after SFAS 133 for the largest 25 banks 
(p = 0.018 and p = 0.019, respectively). Regarding the two foreign exchange derivative exposure 
measures (FxTrd and DeltaFxTrd), we observe a similar pattern only for FxTrd (p = 0.089). The 
negative coefficients on IrTrd, DeltalrTrd, and FxTrd are consistent with bond markets, on aver- 
age, viewing trading derivatives as being used to cross-hedge other risk exposures. The findings of 
greater negative coefficients on Їг 4, DeltalrTrd, and FxTrd following SFAS 133 for the large 
banks are consistent with more economic hedges being classified as trading after SFAS 133 by 
these banks. These findings generally support H2. However, for the remaining banks, we find no 
support for H2, perhaps because these relatively smaller banks do not have much involvement in 
or exposures from trading derivatives. Also, except for supporting H2 for the largest 25 banks, H1 
and H2 are not supported for foreign exchange rate derivatives, perhaps because banks' exposures 


23 The multinomial logit regression for Nonuser (Trader) status correctly classifies 89 percent (77 percent) of banks. 

24 When tax-adjusted bond spreads are decomposed into systematic risk and default risk premium components as sug- 
gested by Elton et al. (2001), we find that the negative association with spread is most pronounced for the systematic 
risk premium component, whereas the magnitude of the negative association with default risk premium is modest. The 
‘results of these tests are excluded for brevity. 

25 The negative and positive associations of tax-adjusted spread with accounting measures of derivative exposures and 
Gap, respectively, are in line with the findings of Schrand (1997) who documents that maturity gap (non-trading interest 
rate risk) is positively associated with higher perceived risk, and hedging derivatives decrease the maturity gap, thus 
reducing perceived risk. 
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from foreign exchange rate derivatives are not material enough to affect their bond spreads. 

Among the variables that proxy for exposures from off-balance sheet instruments (other than 
derivatives) that are potentially used for hedging, LoanComm and Strip are negatively and signifi- 
cantly related to tax-adjusted bond spread before and after SEAS 133 (р = 0.078 and р = 0.080 
for LoanComm; p = 0.054 and p = 0.090 for Strip, respectively). CrDer is also negatively related 
to tax-adjusted spread, but only prior to SFAS 133 (р = 0.075). 

The results reported Table 4, Panel A indicate that the association between bond spread and 
accounting measures of derivative exposures is driven mainly by interest rate derivatives. In 
unreported tests, we find support for both H1 and H2 based on different types of interest rate 
derivative instruments (Le., interest rate swaps and interest rate options). 

An examination of sample bank 10-Ks and Call Reports reveal that 97 of the 141 sample 
banks had to transfer derivatives previously classified as hedging to trading because of the various 
provisions of SFAS 133, indicating that SFAS 133 had direct effects on the classification of 
derivatives for a majority of the sample banks. Our hypotheses still hold for the banks that did not 
have to reclassify any derivatives. The more stringent criteria imposed by SFAS 133 (rather than 
the relatively lax criteria before SFAS 133) assure that derivatives classified as hedging in the 
post-SFAS 133 period are, in fact, highly effective hedging derivatives. In addition, as argued by 
the FASB (SFAS 133, 4 238), SFAS 133’s detailed documentation requirements likely improve the 
monitoring of hedging activities and result in less uncertainty in the financial statements, leading 
to Jower bank bond spreads for all derivative-user banks. Overall, these results support the view 
that banks benefit from the more credible derivative-related information provided under SFAS 
133. 


Economic Effects of SFAS 133 on Bond Spreads 

The findings from the bond-pricing tests indicate that the association between the accounting 
measures of derivative exposures and bond spread is driven mainly by interest rate derivatives. To 
provide some indication of the economic effects of interest rate derivatives before and after SEAS 
133, we estimate their effects on bond spread using the estimated coefficients reported in Panel A 
of Table 4 and descriptive statistics reported in Table 2. All estimates for the periods before and 
after SFAS 133 are based on mean observations for the entire sample period. 

Our results indicate that banks with average values of IrHdg and DeltalrHdg have bond 
spreads that are 19.5 basis points lower in the pre-SFAS 133 period and 30.8 basis points lower in 
the post-SFAS 133 period than banks with no exposure from interest rete hedging derivatives.” 
Similarly, banks with average values of IrTrd and DeltalrTrd have bond spreads that are 5.8 basis 
points lower ш the pre- and 9.9 basis points lower in the post-SFAS 133 period than banks with 
no exposure from interest rate trading derivatives.” 


Results of Tests of Hypotheses 3 and 4 


Panels A through D of Table 5 present post-SFAS 133 and pre-SFAS 133 Spearman correla- 
tions (above and below the diagonal, respectively) for the variables used in the exposure co- 
variation tests. nee we partition the entire sample into banks with large and small portfolios of 
hedging derivatives. Panels A and B display correlations among the variables used to estimate 


26 We use the weighted average of the coefficients for the largest 25 banks and other banks when estimating the effects of 
hedging derivatives because the effect oi hedging derivatives on bond spread is significantly negative for both groups of 
banks. 


27 We estimate the effects of trading derivatives only for the largest 25 banks because our results indicate that the effect of 
в meding derivatives on bond spread is significantly negative for only these banks. 
28 We consider a bank to have a large (small) portfolio of hedging derivatives if the notional amount of hedging derivatives 
scaled by beginning book value of equity is above (below) the sample median in the pre-SFAS 133 period. 
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Equation (3). As presented in Panel A, for high-hedger banks, the correlation between ADerHedg- 
ing and ANonTrading is more negative after SEAS 133 (i.e., it changes from —0.227 to —0.284), 
providing preliminary evidence in support of H3. Although not as sizable, the change in the 
correlation coefficient (from —0.155 to —0.175) is in the same direction for banks with small 
portfolios of hedging derivatives as presented in Panel B. 

Panels C and D of Table 5 display correlations among the variables used in estimating 
Equation (4) for banks with large and small portfolios of trading derivatives, respectively. The 
negative correlations between ADerTrading and ANonDerTrading presented in Panels C and D 
provide evidence in support of trading derivatives offsetting exposures from non-derivative trad- 
ing positions. For high-trader banks, which primarily consist of large banks, the correlation be- 
tween ADerTrading and ANonDerTrading is more negative after the adoption of SFAS 133 
(r = —0.128; p = 0.039) compared to the pre-SFAS 133 period (r = —0.091;p = 0.196), 
consistent with more economic hedges being classified as trading after SFAS 133. A similar trend 
also exists for low-trader banks. However, the correlation coefficients are smaller and insignificant 
(r = —0.056; p = 0.426 and r = —0.073; p = 0.238 before and after SFAS 133, respectively). 

The insignificant correlation coefficients between ADerHedging and ANonDerTrading 
(ADerTrading and ANonTrading) observed in Panels A through D indicate that banks do not 
cross-hedge trading (non-trading) positions with hedging (trading) derivatives. These findings are 
consistent with Liu et al. (2004), who find no significant cross-hedging between trading exposures 
and non-trading exposures. 

The results from estimating Equation (3) that are reported in Panel A of Table 6 are mostly 
consistent with the correlation analyses. Panel C presents the results of the tests of H3 for banks 
with large and small portfolios of hedging derivatives. The coefficient on ANonTrading is more 
negative after SFAS 133 for both groups of banks (p — 0.030 and p — 0.078, respectively), 
providing further evidence that derivatives classified as hedging consist of hedges that meet 
stricter hedge effectiveness criteria after SFAS 133. 

Results from estimating Equation (4) are reported in Panel B of Table 6. Panel C presents the 
results of the tests of H4 for banks with large and small portfolios of trading derivatives. The 
negative and significant coefficient on ANonDerTrading for both groups of banks before and after 
SFAS 133 suggests that trading derivatives cross-hedge non-derivative trading exposures. Also, 
the coefficient on ANonDerTrading is more negative subsequent to SFAS 133 for banks with large 
portfolios of trading derivatives (p — 0.054), providing support for H4. More specifically, this 
finding suggests that the extent of cross-hedging between trading derivatives and non-derivative 
trading positions is greater after the adoption of SFAS 133, consistent with more economic hedges 
that do not qualify as accounting hedges being classified as trading subsequent to SFAS 133. 
However, for banks with small portfolios of trading derivatives, the coefficients on ANonDerTrad- 
ing are not reliably different before and after SFAS 133 (p — 0.151). 


Sensitivity Analyses and Additional Considerations 

We conduct several sensitivity analyses to assess the robustness of our findings. First, we 
re-run our tests after deleting the three largest banks (J.P. Morgan Chase, Bank of America, and 
Citibank) that dominate the derivatives market during our sample period. We continue to find 
support for all hypotheses. Second, a potential explanation for our findings is that the change in the 
yield curve coincides with the adoption of SFAS 133. The flat yield curve during the pre-SFAS 
133 period (1999-2000) became steeply upward sloped in 2001. It remained so through the end of 
2004 and flattened in the 2005—2006 period. In order to address the possibility that our results are 
driven by the different-shaped yield curves in the pre- and post-SFAS 133 periods rather than by 
SFAS 133, we re-test our hypotheses using the years 2005-2006 as the post-SFAS 133 period. 
This specification ensures that both pre- and post-SFAS 133 period yield curves are flat. We 
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continue to find strong support for cur hypotheses, suggesting that the change in the yield curve in 
2001 is not driving our results. Third, as mentioned earlier, we estimate the size of the net 
long/short positions in trading derivatives for some banks. These estimates are likely to contain 
error. Therefore, we re-run our tests using gross (long plus short) notional values for trading 
derivatives. Our results regarding the effect of SFAS 133 still hold. Fourth, we repeat our tests 
after excluding the top and bottom 1 percent of the distribution of each variable to assess the 
sensitivity of our findings to extreme observations. We continue to observe results consistent with 
our primary findings. Last, we find similar results for the exposure covariation tests of H3 and H4 
when we scale all variables by beginning total assets instead of beginning book value of equity as 
we do in our primary tests. 

Another potential alternative explanation for the difference between the risk relevance of 
derivatives before and after SFAS 133 is the possibility of a change in banks’ derivative strategies. 
However, from a cost-benefit standpoint, we believe that there are a number of reasons for banks 
not to change their derivative strategies following SFAS 133. First, derivatives are efficient and 
low-cost risk management tools for banks. In the absence of derivatives, managing interest rate 
risk would require significant amounts of capital investment. In addition, derivative contracts can 
be customized, depending on the risk being hedged. Thus, the benefits of managing interest rate 
tisk via efficient and low-cost risk management tools likely outweigh the costs of using alternative 
hedging strategies. Second, a major concern about SFAS 133 expressed by banks is the increased 
volatility likely to be introduced by SEAS 133 to banks’ earnings. However, banks can poten- 
tially offset this volatility in a cost-effective way by using other tools for smoothing income (such 
as loan loss provisions) without changing their derivative strategies (Wahlen 1994; Liu and Ryan 
2006; Kanagaretnam et al. 2010). In fact, Kilic et al. (2010) document that banks that are likely to 
be most impacted by SFAS 133 have increased their reliance on loan loss provisions for income 
smoothing in the post-SFAS 133 period. 

Finally, we note that, although the volume of trading derivatives grew rapidly during our 
sample period,” this rapid growth is unlikely to drive our results regarding the effects of SFAS 
133 for a number of reasons. First, our hypotheses are stated and models designed to test whether 
the pricing of a unit of derivative exposure changed with SFAS 133, regardless of the amount of 
total exposure. Second, the growth trend cannot be generalized to all banks; it is driven mostly by 
the largest seven derivative-dealer banks (Office of the Comptroller of the Currency Report on 
Derivatives Activities 1995-2006). We continue to find support for all our hypotheses when we 
re-run our tests after deleting these seven banks from the sample. Third, the growth trend started 
in the early 1990s and has continued since then. Thus, there is no evidence of a shift in the trend 
that coincides with the adoption of SFAS 133. 


Ү, SUMMARY AND CONCLUSION 
SFAS 133 is a widely debated and controversial accounting standard that replaced the incom- 
plete and inconsistent accounting guidance for derivatives that existed prior to its adoption. We 
examine how SFAS 133 affected the risk relevance of accounting measures of bank derivative 
exposures to bond investors. We hypothesize and find that interest rate hedging derivatives are 
more negatively associated with bond spreads after SFAS 133. These findings are consistent with 


2 SFAS 133 was expected to introduce more volatility to banks’ earnings because it requires more economic hedges to be 
classified as trading with gains/losses recognized in income immediately and any hedge ineffectiveness to be recognized 
in income immediately 

30 The annual growth rates of derivatives classified as trading are 35.71 percent in 1996, 29.46 percent in 1997, 53.65 
percent in 1998, 3.25 percent in 1999, 17.88 percent in 2000, 11.92 percent in 2001 (the year of adoption of SFAS 133), 
21.32 percent in 2002, 28.76 percent in 2003, 28.61 percent in 2004, 5.38 percent in 2005, and 11.52 percent in 2006. 
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SFAS 133 restricting derivatives classified as hedging to only those instruments that serve as 
highly effective and documented derivative hedges. For the largest 25 banks, which are exten- 
sively involved in derivatives trading, we find that interest and foreign exchange rate trading 
derivatives are more negatively associated with bond spreads after SFAS 133, consistent with 
more economic hedges being classified as trading after SFAS 133. To provide further insights, we 
also examine how SFAS 133 altered the association of exposures from derivatives with non- 
derivative exposures. Consistent with our results from the risk-relevance tests, the findings from 
the exposure covariation tests indicate that the offsetting effects between hedging derivatives and 
non-trading exposures, and between trading derivatives and non-derivative trading exposures, are 
greater after SFAS 133. Taken together, these findings indicate that, despite its limitations, SFAS 
133 has improved the transparency and the monitoring of the risk implications of derivatives. 


APPENDIX A 
CONTROL VARIABLES USED IN THE BOND-PRICING MODELS 
Variable Name Data Source Definition and Explanation 
Bond Characteristics 
Maturity Mergent FISD Natural logarithm of the bond’s years to maturity. Bonds 


with longer maturities (MATURITY) are expected to have 
higher spreads due to lower liquidity (Hancock and 
Kwast 2001; Covitz et al. 2000). 

Amount Mergent FISD Natural logarithm of the size of the bond issue in 
millions of dollars. Bonds issued in larger amounts are 
expected to have lower spreads due to higher liquidity 
benefits (Hancock and Kwast 2001; Morgan and Stiroh 
2001) and due to economies of scale in underwriting 
(Sengupta 1998). 

Bank Characteristics 

Size FR 9-YC Natural logarithm of the bank’s total assets. Larger banks 
are less likely to default because they are better 
diversified and are more likely to benefit from the 
“too-big-to-fail” policy formalized by the Comptroller of 
the Currency (Flannery and Sorescu 1996; Morgan and 
Stiroh 2001). Also, their bonds are more liquid and, 
therefore, are traded at lower spreads over Treasury 
bonds (Flannery and Sorescu 1996). 

Insured Bank Compustat Ratio of total insured deposits to total deposits. Banks 
with larger insured deposits are subject to less market 
monitoring and discipline (ВШец et al. 1998) and, 
therefore, shift to greater use of insured deposits as their 
financial condition deteriorates (Billett et al. 1998; 
Jordan 2000). As a result, banks with higher insured 
deposits are perceived to have greater potential for moral 
hazard and, therefore, higher bond spreads (Jagtiani et al. 
2002). 

Estate FR 9-YC Real estate acquired in satisfaction of debt previously 
contracted scaled by total assets. Creditors may view 
higher property collected on foreclosed real estate loans 
as a signal of lower loan quality and higher credit risk, 
and may demand higher bond spread (Morgan and Stiroh 
2001). 
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Variable Name Data Source 
Мрі FR 9-YC 
Gap 10-K 
NonDerTrd 10-K 
Cap FR Y9-C 
Roa FR Y9-C 
CrRisk FR Y9-C 
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Definition and Explanation 


Sum of nonaccrual loans and accruing loans past due 90 
days or more scaled by total assets. Higher nonaccrual 
and past due loans signal lower loan quality and higher 
loan losses. Higher loan losses reduce the capital base 
more and require greater charges against income through 
higher provisions for loan losses, thereby increasing bank 
default risk (Flannery and Sorescu 1996; Morgan and 
Stiroh 2001). 

Absolute value of the difference between bank 
non-trading assets and non-trading liabilities maturing or 
repricing within one year scaled by total assets. Higher 
maturity mismatch between non-trading assets and 
liabilities increases a bank’s non-trading interest rate risk 
exposure and overall risk (Flannery and James 1984; 
Flannery and Sorescu 1996; Morgan and Stiroh 2001). 
Fair value of non-derivative trading assets scaled by total 
assets. Trading activities are associated with higher bond 
spreads due to potential agency risks associated with 
such activities. That is, banks may more easily assume. 
trading risks at levels that are suboptimal for 
bondholders, since trading positions can be easily 
changed or concealed with leverage. Such uncertainty 
about trading activities gives rise to agency risks 
(Morgan and Stiroh 2001). 

Ratio of book value of equity to risk-weighted assets. A 
bank’s assets differ in terms of riskiness. Some assets 
(e.g., commercial loans) are considered riskier than 
others (e.g., real estate loans), and are given higher risk 
weights in the calculation of regulatory capital. Bank 
capital is a cushion for potential losses. Thus, Cap will 
be lower for banks with riskier asse:s. Cap jointly 
captures the asset risk composition and capital adequacy. 
Higher capital is likely to result in creditors demanding 
lower spread. 

Annual net income scaled by total assets. Higher 
profitability is associated with lower likelihood of default 
and bond spread (Flannery and Sorescu 1996; Morgan 
and Stiroh 2001). 

Charge-offs incurred on derivative contracts during the 
year (i.e., losses incurred when the counterparty fails to 
meet the settlement obligations of the derivative contract) 
scaled by total assets. Our derivative exposure measures 
proxy market risk exposure. However, derivatives also 
expose the bank to credit risk. The charge-offs incurred 
on derivatives are expected to increase bond spread. 


Off-Balance Sheet Hedges other than Market Risk Derivatives 


LoanComm 10-K 


The Accounting Review 


Amount of loans for which official promises to lend are 
conveyed scaled by total assets. Banks sometimes use 
forward loan sales commitments as economic hedges of 
commitments to originate loans (Office of the 
Comptroller of the Currency 2005) because fair values of 
these commitments are offsetting. 


(continued on next page) 


May 2011 
American Accounting Association 


798 


Variable Name Data Source 
Guarantee 10-K 
CrDer 10-K 
Strip 10-K 
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Definition and Explanation 


On- and off-balance sheet guarantees and insurance 
contracts that back financial obligations scaled by total 
assets. Banks enter into financial guarantee and insurance 
contracts that back off-balance sheet instruments or 
financial obligations such as outstanding securities and 
loans. 

Notional amount of credit derivatives on which the bank 
is the beneficiary scaled by total assets. In order to hedge 
their credit risks (losses) from the loans they originate, 
banks sometimes enter into credit derivative contracts 
with the bank as the beneficiary. Such contracts 
constitute guarantees against a bank’s credit losses from 
other parties and are expected to reduce bank credit risk 
and thus bond spread. 

Fair value of principal-only strips scaled by total assets. 
Principal-only strips do not qualify as derivative 
instruments under the requirements of SFAS 133 
(paragraph 14), but these instruments are used to hedge 
prepayment risk especially from mortgage servicing 
rights, which is driven by decreasing interest rates. 


APPENDIX B 


VARIABLES USED IN THE FIRST-STAGE MULTINOMIAL LOGIT MODEL 


Variable Name Data Source 
Big? FR Y-9C 
Віза (Веру >) FR Y-9C 
Size FR Y-9C 
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Definition and Explanation 


Dummy variable that equals 1 if the bank is one of 
largest 25 banks in the United States, and 0 otherwise. 
Holdings of derivatives in the United States are 
concentrated in the largest 25 banks, whereas the 
remaining holdings (particularly for hedging purposes) 
are still economically significant for the remaining user 
banks because their asset portfolios are remarkably 
smaller than those of the largest 25 banks. In order to 
better capture the effect of large banks, we include 
discrete size variables. 

Dummy variable that equals 1 if the bank is one of 
largest three (4 to 25) banks in the United States, and 0 
otherwise. Among the largest 25 banks, the largest three 
banks (J.P. Morgan Chase, Bank of America, and 
Citigroup) dominate derivatives activity (Office of the 
Comptroller of the Currency Derivatives Report, 
1998-2003). 

Natural logarithm of total assets. A number of studies 
(e.g., Booth et al. 1984; Koppenhaver 1990; Kim and 
Koppenhaver 1992; Gunther and Siems 2002) document 
that bank size is positively related to derivative use 
because large banks can more easily bear costs 
associated with implementing derivative trading and 
hedging programs such аз the costs of personnel training 
and internal control systems. 
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Variable Name 
Lev 


Урі 


Incentive 
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Data Source 
FR Y-9C 


FR Y-9C 


FR Y-9C 


FR Y-9C 


FR Y-9C 


ExecuComp 


Definition and Explanation 


Ratio of risk-weighted assets to book value of equity. 


Lev jointly captures the probability of financial distress 
(Nance et al. 1993; Géczy et al. 1997; Sinkey and Carter 
2000) and balance sheet composition in terms of asset 
riskiness. Banks with higher leverage have higher 
probability of financial distress, and are more likely to 
use derivatives to reduce the variance of bank value. 
Also, banks holding assets with higher credit risks (i.e., 
higher risk-weighted assets and therefore higher Lev) are 
more likely to use derivatives to hedge other exposures 
(Schrand and Unal 1998). On the other hand, a bank 
with low capital may lack credibility as counterparty and 
creates credit risk for the opposite party in derivatives 
trading. Therefore, the likelihood of holding trading 
derivatives is expected to decrease with Lev (Kim and 
Koppenhaver 1992). 

Absolute value of the difference between bank assets and 
liabilities maturing or repricing within one year scaled by 
total assets. Banks with greater maturity mismatch and, 
therefore, greater exposure to interest rate risk are more 
likely to use derivatives to hedge their exposures (Sinkey 
and Carter 2000; Gunther and Siems 2002). 

Net interest income scaled by total assets. Banks with 
lower net interest margins are more likely to use 
derivatives, particularly for trading purposes, in an 
attempt to increase fee income by selling derivatives or 
by speculating in derivative markets to offset reduced 
spreads from traditional banking activities (Shyu and 
Reichert 2002). 

Ratio of the sum of nonaccrual loans and accruing loans 
past due 90 days or more to total assets. Lower loan 
quality is associated with higher credit risk. Às part of a 
joint-risk management program, banks exposed to 
significant levels of credit risk are more likely to use 
derivatives to hedge other exposures (Schrand and Unal 
1998). 

Sum of cash and due, federal funds sold and securities 
purchased to resell scaled by total assets. Banks with 
greater liquidity are in a better position to manage large 
derivative positions (Jordan 1995) and are more likely to 
engage in derivatives trading activities. 

Ratio of option vega to delta. Risk management theory 
identifies risk-taking incentives of managers as a 
determinant of risk management (Stulz 1984; Smith and 
Stulz 1985). Following Core and Guay (2002), we 
measure manager incentives to increase risk (stock price) 
by vega (delta), and include Incentive to capture the 
effect of managerial risk-taking incentives on the 
decision to use derivatives (Tufano 1996; Rogers 2002). 
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Variable Name Data Source Definition and Explanation 
Convexity Bank Expected percentage tax savings from a 5 percent 
Compustat reduction in the volatility of taxable income. In the 


presence of tax function convexity, income smoothing 
reduces expected tax liabilities (Smith and Stulz 1985). 
Hedging serves as a mechanism to smooth income over 
time. Therefore, a nurnber of studies (e.g., Nance et al. 
1993; Dolde 1995; Mian 1996; Tufano 1996; Géczy et 
al. 1997; Graham and Rogers 2002) control for tax 
convexity as a determinant of derivative usage for 
hedging purposes. We measure tax convexity following 
Graham and Smith (1999). 

ComIndLoan FR Y-9C Ratio of commercial and industrial loans to total assets. 
Commercial and industrial loan customers use derivatives 
to hedge their risks, and banks’ familiarity with potential 
derivative customers increase the likelihood and extent of 
their participation in derivatives market as a dealer (Kim 
and Koppenhaver 1992). 


APPENDIX C 
DERIVATIONS OF UNADJUSTED SPREAD AND 
TAX-ADJUSTED SPREAD 
Unadjusted Spread 

We compute unadjusted bond yield spreads as the difference between the actual yield to 
maturity on the bank bond at issuance, and the yield to maturity on a synthetic Treasury bond, г, 
that perfectly matches the bank bond in terms of coupon payment, maturity, duration, and con- 
vexity. 

In order to obtain r,, we first treat each payment made by a coupon-bearing bank bond as a 
zero-coupon bond. For each of these payments, we calculate the risk-free spot rate using a para- 
metric yield curve specification suggested by Gurkaynak et al. (2006). This specification, initially 
proposed by Nelson and Siegel (1987) and extended by Gurkaynak et al. (2006), is as follows: 


rn) = 8+ а = +a LM EC - +, EC -&). (Ал) 
т т т 


where ғ, (л) is the yield to maturity of an n-year zero-coupon risk-free (Treasury) bond at time 5 
n is number of years till maturity; 7, and 7, are parameters that determine the locations of the 
“humps” in the yield curve. The parameters of this specification from 1961 to 2006 are provided 
on a daily basis by Gurkaynak et al. (2006).3! Thus, for a given date during the 1961-2006 period, 
yields can be computed for future bond payments, each of which is assumed to be a zero-coupon 
bond, at all maturities. Next, we discount each payment made by the bond at a rate derived from 
Equation (A.1), and sum up the discounted values to obtain the price of a Treasury bond, which 
perfectly matches the bank bond in terms of coupon payment, maturity, duration, and convexity. 
The Treasury bond price, РІЗ, is computed аз: 


3! The parameters are also available at http.//www.federalreserve.gov/pubs/feds/2006. 
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Cero! (ler)? "U Aer Qn" 
where r, is the vield derived from Equation (A.1) for a zero-coupon bond that matures in n years; 


C is the coupon payment; and F is the face value of the bond. In order to obtain the appropriate 
risk-free spot rate, we then solve the following equation for r,: 


TR (A.2) 


C C C F 


TR Lá , 
Gtr) (рагу (1-2 Ger 





(А.3) 


Tax-Adjusted Spread (Spread less Tax Premium) 

In order to obtain a measure of tax-adjusted spread, ry, we reduce total spread, r, by the 
amount of estimated tax premium, r, We estimate r, as the difference between the portion of 
spread explained by expected losses from default and taxes, and the portion explained by default 
premium only. Re-arranging the equations in Elton et al. (2001) yields: 


D 
пем ze. (А.4) 
where D = (1 ~ Роз) + fret, and T= EC d - ty), and г, is the tax premium; t, is 
the state tax rate; 1, is the federal tax rate; and other terms are as defined in Equations (A.2) and 
(A.3). Spread less tax premium, ry, then equals the difference between г, and г, We assume an 
effective tax rate, t,(1— і), of 4.2 percent based on the grid search procedure in Elton et al. 
(2001). We examine different levels of tax rates between 3 percent and 5 percent at increments of 
02 percent.” An effective tax rate of 4.2 percent minimizes the squared mean error between 
derived and actual sample bond prices. This effective tax rate is consistent with pairs of federal 
and state tax rates such as 33 percent and 6.3 percent, respectively. 
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Supply Chains and Segment Profitability: 
How Input Pricing Creates a Latent 
Cross-Segment Subsidy 
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ABSTRACT: Recent years have seen an Increased emphasis on developing more 
precise accounting measures of market- and customer-level profitability. By their very 
nature, such segment profitability measures may unwittingly neglect complementarities. 
Such сотрїететаг өв have been widely recognized on the demand side due to brand 
recognition, predatory pricing, or interdependent products. We develop a model show- 
ing that important supply-side complementarities can also be prevalent. In particular, 
when a firm relies on a self-Interested supplier for Inputs used across multiple seg- 
ments, the wholesale price it pays depends on the average profitabillty of its segments. 
Taking such Interaction between upstream pricing and the firm's downstream reach into 
account, the model shows that: (1) segment profit calculations can understate or over- 
state the value added by the segment depending on the segment's relative contribution 
margin, and (2) the firm sometimes benefits from devoting resources to less profitable 
segments and perhaps even from serving seemingly unprofitable markets and/or cus- 
tomers. 


Keywords: input pricing; performance measurement; segment profit; supply chains. 


I. INTRODUCTION 

ccountants have been persistent in attempts to generate disaggregate data that provides 

meaningful measures of segment profitability at both the market and customer levels. 

While joint costs and resource utilization have Jong been sticking points in such endeav- 
ors, recent developments stemming from activity-based costing (ABC) and customer lifetime 
value (CLV) analysis have been highly touted as means of better determining which markets and 
customers to service and the level of resources to devote to them. As a result, reliance on such 
measures has grown substantially in recent years. 
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While the desire to finely parse performance is understandable, history is rife with examples 
in which refined performance measures failed to fully reflect opportunity costs of decisions.’ In the 
context of market- and customer-level profitability, such opportunity costs have been recognized in 
terms of demand-side interdependencies. As examples, the benefits of having a loss-leader product 
for complementary demand-side effects on other products (e.g., “freebie marketing") or incurring 
sustained losses for competitive posturing (e.g., dumping or predatory pricing) have been well 
recognized. As such, practitioners and academics alike are acutely aware of demand-side consid- 
erations when making resource allocation choices. 

In this study, we demonstrate that, in addition to the well-documented demand-side spillovers, 
supply-side spillovers can also be present, necessitating adjustments to segment profits in order to 
evaluate the value added by a segment. In particular, even if market and customer profitability 
measures closely track individual segment profitability, they do not fully reflect the cross- 
subsidization that can arise when a multi-segment firm relies on a supplier for key inputs. The 
cross-subsidization stems from the fact that supplier pricing is influenced by the extent and prof- 
itability of the buying firm’s overall downstream reach. 

To elaborate, the input price secured by a conglomerate depends on the conditions faced by 
the portfolio of markets in which it operates. As a result, altering the portfolio of markets that a 
firm serves alters the supply environment as well. A large retail conglomerate that has expanded 
beyond the “low-hanging fruit” of its prime markets to markets with more modest profitability can 
secure lower input prices. Conventional wisdom suggests such lower prices arise due to the buyer 
power that greater size affords. The results here suggest that there is more to buyer power than 
mere size; when a firm procures inputs for multiple uses, input prices are affected by average 
market conditions, and thus expansion to less profitable markets has the concomitant silver lining 
of lowering the prevailing input price.” 

The basic model is of a firm that relies on a supplier for an input of material or labor that is 
used across multiple markets reflecting different segments or customers. The markets can differ in 
both the extent of retail demand and the degree of competition. With a presence in multiple 
markets, the seller is charged a wholesale price for the inputs it procures. Because the price does 
not dictate how or where the item has to be used, the firm is able to effectively prevent the input 
supplier from price discriminating based on usage. As such, when evaluating a segment’s worth to 
the firm, the critical issue is not only an individual market’s profitability, but also that market’s 
effect on the firm’s purchase sensitivity to input pricing. 

Having identified this supply-side consideration, we next examine the circumstances under 
which such forces arise. In particular, given that the segments differ in both competition and retail 
demand, we identify the underlying drivers of cross-segment complementarities due to their effect 
on input price sensitivity. Interestingly, determining a segment’s net effect on the firm’s input 
price, and hence the direction of its supply-side effect, amounts to a concise comparison of a 
competition-weighted contribution margin for that segment to the competition-weighted average 
contribution margin of the other segments. 

In this realm, practitioners routinely criticize firms for empire building and apparent corporate 
socialism, in that a firm that is highly successful in one arena tends to expand into other arenas 


t Commonly discussed examples inclode the fact that accounting accruals occur in undiscounted terms and thus fail to 
reflect the cost of capital, product line costs do not reflect constrained capacity, and fixed cost allocations potentially 
induce a “death spiral” (see, for example, Zimmerman 2003). 

2 Consistent with this view, in recent times, major American auto makers have argued that their large dealer networks to 
rural and less-profitable metropolitan areas (as compared to the small, targeted dealership network of their foreign 
counterparts) make them deserving of lower prices from parts suppliers, more favorable labor contracts, and even 
cheaper capital (from government loans). 
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with less success. As an example, Тоусса was widely panned for its push of the Lexus brand in 
Japan where the luxury vehicle market is dominated by European brands (e.g., Chozick 2007). 
Even if this effort does not yield the demand-side benefits of boosted sales in its home market, it 
can nonetheless help provide downward pressure on input prices that also affect its core business 
of luxury car sales in the U.S. In a similar vein, the slow start in U.S. sales of the new Acura ZDX 
does not bode as poorly for the firm as one may think, since the ZDX relies on as many common 
parts as the more widely profitable Acura MDX and Honda Pilot models and may, thus, help keep 
input prices low on these more profitab-e models. 

Put differently, if the firm chooses to eliminate an underperforming segment and operate only 
its most successful product line, firm-w-de profit would decrease by more-than-indicated segment 
profit figures alone due to the fact that Ле low-profit segment plays a large role in restricting the 
supplier wholesale price markup. Because such wholesale price adjustments are off-equilibrium, 
they are not reflected in (on-equilibrium: profit calculations and, as such, represent a latent subsidy 
provided by less-profitable markets to raore-profitable markets. 

As we demonstrate, this force means that it can even be the case that expanding into a 
seemingly unprofitable market may be optimal for a profit-maximizing firm. That is, when ex- 
panding (1) yields modestly negative psofit in the new segment and (2) such losses are partly due 
to low margins of the new segment, it can be the case that expansion is worthwhile due to 
supply-side complementarities despite the negative segment profit that ensues. Our analysis pro- 
vides precise conditions under which this phenomenon can (and cannot) occur. The theme of the 
analysis is further explored by examinirg the consequences of rivals' reliance on the same supplier 
as the multi-segment firm, and considering the effects of price competition in the retail realm. We 
demonstrate that this basic theme is robust with respect to these variations, though subtle new 
considerations arise. 

The thrust of this study lies at the intersection of discussions about (1) measuring segment 
profitability and (2) coordinating self-irterested supply chain partners. In the realm of profitability 
measurement, activity-based costing (ABC) has created a substantial improvement in the ability to 
track profits at the market or customer level (e.g., Cooper and Kaplan 1988). Through improved 
cost allocations, these processes are viewed by both academics and practitioners as critical to 
properly evaluating product line choices, resource allocation across segments, and keep-or-drop 
decisions (e.g., Goebel et al. 1998; Searcy 2004; Sopariwala 2005). At the customer level, ABC 
and customer lifetime value (CLV) calculations are used jointly to evaluate which customers to 
serve and target (e.g., Hogan et al. 2002, Searcy 2004; Venkatesan and Kumar 2004; Kuchta and 
Troska 2007). While significant efforts have been made to incorporate effects of downstream 
competition and demand-side complenentarities in such calculations, our results point to strategic 
upstream supplier pricing as another issue worthy of consideration. 

Supply chain frictions have been scrutinized by researchers in many areas of management. A 
fundamental concern at the forefront o7 such studies is the inherent conflict of interest in pricing. 
While supply chain partners have manr common goals, individual interests can lead to inefficien- 
cies that harm all. Perhaps most commonly analyzed among supply chain inefficiencies is that 
arising from double-marginalization. The issue of double-marginalization was first analyzed by 
Spengler (1950), following which several studies have considered the extent of pricing-induced 
distortions in supply chains and the myriad of means to alleviate such distortions (for summaries, 
see Katz (1989) and Lariviere [2008]) 2 We examine whether and how supply chain coordination 
concerns due to double marginalization interact with decisions a seller makes with regard to the 


3 Research in this area has also examined circumstances where prices above cost (and double-marginalization) prove 
helpful. Explanations include strategic customer behavior (Su and Zhang 2008), the time-inconsistency problem and 


The Accounting Review May 2011 
American Accounting Association 


808 Arya and Mittendorf 


markets it serves. In doing so, we demonstrate a supply-chain-induced interdependency among 
product lines and customers in a conglomerate that is not reflected in standard segment profit 
calculations. 

Extant research has also stressed other forms of supply chain inefficiency, manifest in terms of 
moral hazard among supply chain participants (e.g., Baiman et al. 2000; Balachandran and 
Radhakrishnan 2005; Hwang et al. 2006; Saouma 2008), adverse selection due to private infor- 
mation of supply chain members (e.g., Baiman and Rajan 2002), as well as a combination of these 
frictions (e.g., Baiman et al. 2001). In this vein, we stress that while our analysis examines a key 
interdependency among segments due to supply chain frictions in the context of double- 
marginalization, the key insight is not from double-marginalization per se but instead in showing 
that nontrivial supply chain frictions can induce supply-side spillovers that are not fully appreci- 
ated in segment profit calculations. Consistent with this point, we note that similar results can be 
obtained if the source of supply chain friction is due to adverse selection instead of 
double-marginalization.* 

П. MODEL 

A downstream (retail) firm relies on an upstream (wholesale) supplier for a key input. In 
converting the input and providing final goods to consumers, the downstream provider can reach 
multiple markets, some of which are more profitable than others. In particular, the retailer serves 
п segments, п 2 2, and requires one unit of input per unit of output in each segment. In segment 
і, i = 1,...,n, the retailer faces competition from a (Cournot) competitor, denoted rival i. For now, 
presume rivals do not rely on the inputs provided by the supplier. Retail demand in segment i for 
the firm (rival) is reflected by the inverse demand curve рр=а;—4— уй (Pj ai- ĝi- уд), 
where р; (pj) is the firm's (rival’s) retail price for each unit, and q; (4,) is the firm's (rival's) 
number of units sold. In the demand curve, а, a; > 0, captures the degree of retail demand, and y,, 
0 = у= 1 captures the degree of competitive intensity. 

Given these retail market conditions, the supplier determines the wholesale price for its 
inputs, w. For simplicity, we normalize the supplier’s production cost and the firm’s and rivals’ 
other costs to zero. Upon observing the prevailing wholesale price, the firm and its rivals (con- 
currently) choose the quantity of output to provide to their respective retail markets. Denote the 
supplier’s chosen wholesale price by w” and the competitive retail outcome by q” = (47); and 
4"”-14 | (in each case, the superscript n denotes the equilibrium when the firm participates in n 
markets). Given this outcome, segment i profit, i = 1,...л, is пау, d; М") | аг з 4; — yd; la; 
— w"qi, and firm profit is П”(4” , 4”, и") = > mq; , d; и”). 

Given this setup, our interest is in determining the value added by a segment. Without loss of 
generality, consider the value of segment n. Using the above notation, in equilibrium, segment n's 
realized accounting profit is т,(4", 4", и") = п’. The value added by segment n is firm profit 
with segment n less firm profit that would be realized in the absence of segment n. That is, the 
value added by segment n, denoted У,, is У, = II"(q" , d^ , w") - II"71(g^7! 4", и"). The ques- 
tion we address in subsequent analyses is if and how accounting segment profit, пр differs from 
segment value added, У,. 

Ш. RESULTS 
Equilibrium Supply Contracts 

Before comparing the accounting measure of segment value with the underlying value added 

by a segment, we first derive the equilibrium outcome for operations in a given set of segments. 





durable goods (Arya and Mittendorf 2006), and the desire to undercut de facto alliances between a supplier and a rival 


(Arya et al. 2008). 
A previous version of this study provided details of such an analysis, 

3 Throughout the analysis, we presume the parameters are such that there is nontrivial participation in each segment in 
equilibrium. 
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Then, comparing outcomes under л versus n —1 segments can provide a full characterization of the 
effects of segment n on firm profit. Working backward in the game, given the quoted wholesale 
price, у», and each rival’s presumed retail quantity, g;, the firm chooses retail quantities to maxi- 
mize its total profit as in (1): 


n 


мас Ха qi- уд - w]ai- | (1) 
4-5 ed 


Similarly, each rival chooses 4, to maximize its retail profit given 4 ee 3 SN 
5 Е fe 
Маха - ĝi- уда. 
й 
Jointly solving the first-order conditions of (1) and (2) yields: 


aj2- y,|-2 2- 
ajo) А and ip) = Ec t me 





Intuitively, the higher the firm's marginal cost (w), the lower is its quantity and the greater is its 
rival's quantity. This feature gives rise to the supplier's induced demand, which governs the 
supplier's problem. In particular, the supplier chooses a wholesale price that maximizes its profit 
realized from the firm; this profit is derived from all л retail segments of the firm, as follows: 


ма, мди) © мах xd : (3) 


il w il 


The firs m : п a[2—y]-4w = А а 
е first-order condition of (3) is За по я =0. Denoting Ка з and q= 23 and the 
corresponding average values by # and а" (16, "= [1/n]Zzk, and 60"-11/л цар, the 
first-order condition yields w^ = а” / [467]. Intuitively, the supplier’s chosen wholesale price is 
affected by two key factors: the level of retail demand and the degree of competition. The greater 
the retail demand (the greater a), the greater is the firm's willingness to pay and, thus, the greater 
is the wholesale price. The greater the competitive intensity (the greater k;), the greater is the threat 
that the rival will poach retail territory (and, hence, the supplier’s wholesale territory) in response 
to a higher wholesale price. As a result, more competitive intensity translates into a lower chosen 
wholesale price. When this thinking is expanded to a firm serving n distinct retail markets, the 
supplier’s price is similarly affected, but now the wholesale price reflects the cross-segment 
averages of each key factor (a^ and Ё). 

Using this equilibrium wholesale price in the firm's and its rivals’ quantity choices reveals 
equilibrium retail quantities, g,(w") and ди"). Lemma 1 summarizes the outcome of this exer- 
cise. (All proofs are in the Appendix.) 


Lemma 1: When the firm operates in n segments, the equilibrium outcome entails: 


O 4-424-ЁФ| 4 = а+ ара, and 


Gi) w'- 4. 

ар 
The lemma provides a crisp presentation of the above intuition. In Lemma 1(1), as empha- 
sized above, the supplier’s wholesale price reflects a weighing of the retail demand and retail 


competition effects on wholesale demand. In each case, the relevant parameter reflects an average 
of those parameters that underlie each segment. The resulting procurement quantities reflect a 
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similar averaging. The quantity chosen by the firm in a given segment is increasing in that 
segment's demand (а; relative to the average demand of its segments (87). This latter effect 
means that the average demand of all segments, by boosting the wholesale price, serves to mod- 
erate the quantity choice of the firm in any given segment. 

Similarly, the greater the competitive intensity a firm faces in any given segment (Kj) relative 


to its average intensity (k"), the lower is its quantity due to its rival grabbing market share. This 
consideration also has reverberations to the rivals' choices in that the firm's average market 
characteristics, by affecting wholesale prices and, in turn, the firm's retail choices, alter rival 
quantities. Thus, in terms of both a firm's competitive choices and its rivals’ choices, a cross- 
segment interdependency arises solely due to upstream pricing considerations. In the case of the 
rival, this interdependency arises despite the fact that the rival does not rely on the supplier and 
participates in a single market. Given the equilibrium outcomes, we next consider if and how 
segment profit differs from segment value added. 


Segment Value Added versus Segment Profit 

Segment profit is the primary means through which a firm and outsiders view the contribution 
each segment makes to the firm's overall profit. When evaluating a segment's worth, it is often 
discussed that accounting profit measures can leave out positive or negative demand-side spill- 
overs across segments. In particular, one product line may be a “1088 leader" in that its segment 
profit is negative, yet it is worth maintaining due to its positive effect on the demand for other, 
more profitable, product lines. Inkjet printers represent a prominent example—printers are often 
8014 near or below cost, but create a captive customer for cartridges tbat are sold at a substantial 
profit. In that case, looking at poor margins on printers reflected in segment profit clearly does not 
represent the whole story. 

In a similar manner to such demand-side effects, this study seeks to identify the role of 
supply-side effects on segment value added. In both cases, accounting segment profit figures fairly 
reflect “what is," but may fail to answer “what if." Our measure of segment value added, on the 
other hand, is a measure of the incremental value of a segment, in that it compares current firm 
value with the value that would have been attained in the absence of that segment. As the 
following lemma demonstrates, the difference between segment profit and segment value added 
can be captured by supply-side and demand-side spillovers. 


Lemma 2: Segment n's value added, V,, can be expressed as: 


У = T, +5, + О where 
би = П7-1",47,и”) ын T""!(q",q",w"), and 


D, = Пам") Па gw). 

Аз Lemma 2 demonstrates, segment value added differs from segment profit due to supply- 
side spillovers (5„) and demand-side spillovers (D,,). In particular, 5, reflects the change in profit 
of the n-1 remaining segments that would be induced by the concomitant change in supplier 
pricing, holding constant quantities demanded. Similarly, D, reflects the change in profit of the 
п-1 remaining segments that would be induced by the concomitant change in retail quantities 
(demand), holding constant the wholesale price. While demand-side spillovers play a prominent 
role in the lexicon of business strategy as evidenced by the prominence in textbooks, academic 
research, and practitioner literature, the role of supply-side spillovers has not been demonstrated. 
The present analysis addresses this gap. 
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With this in mind, the most pertinent expression for the present analysis is $,, which can be 
written more simply as 5, = [w^7! ~ w"][ qt ...+q?_,]. Thus, the sign of 5, amounts to deter- 
mining whether segment n's presence serves to increase or decrease the prevailing wholesale 
price. If 5, > 0, then the wholesale price is reduced by operation in segment n; in such a case, 
segment n provides a latent supply-side subsidy to the other segments. 

Determination of the extent of both the supply-side and demand-side effects requires substi- 
tuting the equilibrium values 4", 2", 477, 4771, и", and и”! from Lemma 1 into II"7!(g , d , w). 
This exercise reveals: 


п-1 
TERRE 
4(п = 1) || в 
m-l п-1 
Ук | иза 
i=] i=l 


2-8 А (=_=) Bu Edd 
"o t4n-DEÀH || у | јр ka n- DEC k, 


Summing 5, and D, yields the following proposition: 

Proposition 1: Segment n’s profit understates the value it provides if it is weaker than 
average (i.e., E « 5) and overstates value if it is stronger than average (1.е., 
чо @ 
ru > >> 

Two implications follow from the 5, and D, expressions that underlie the proposition. First, 
the supply-side effect can either be positive or negative, depending on how the characteristics of 
the segment in question differ from that of other segments. Second, the demand-side effect is 
unequivocally negative. That is, this setting is one where the typical demand-side complementarity 
is not at play. Instead, if a complementarity is to arise, it is on the supply side. Importantly, this 
supply-side effect is of first-order importance, as evidenced by the fact that the condition that 
determines its sign is also the condition that determines the sign of the net effect, as confirmed in 
Proposition 1. 

Proposition 1 completes the comparison of value added and segment profit. To be more 
precise, it confirms that у, m= Sa р, МЕ - =], Х > 0. In other words, despite the 2n 
parameters that underlie the setup (а ... а, and А... К„), the nature of the latent interdependency 
introduced by a segment can be succinctly captured by four parameters (а, а”, Ку, and К). To get 
a better intuitive feel for the condition in the proposition, we next isolate cross-segment variability 
in demand and competition, each in turn. | 

First, say the degree of competition is the same among all retail markets—i.e., у = ... = Yp In 
this case, with k, =k", the condition under which segment profit differs from value added amounts 
to a comparison of demand in that segment to the average demand among the portfolio of seg- 
ments. In particular, segments with lower demand serve to indirectly subsidize those with higher 
demand due to their ability to moderate the prevailing wholesale price. This subsidy is captured by 
the segment’s value added, but it is not reflected in the segment’s profit number. 


Corollary 1: In the у=...=у, case, segments with demand that is below (above) the 
average demand have profits that understate (overstate) the value they provide. 

That is, if а, < 2", then тг, < Vp, and if a, > а", then m, > Vp. 
On the other hand, if demand is the same across segments—i.e., a, ... =а, the only cross- 
market differences are reflected in varying degrees of competition. Note that the condition in the 
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proposition depends on а, and a", which reflect competition-weighted measures of demand. So, 
despite having equal demand (intercepts), cross-market variability in the а; terms can remain due 
to differences in competition. In this case, the condition in the proposition simplifies to a com- 
parison of competition in segment n to the average competition level among the other segments. 
However, the average is not equally weighted; instead, the weights depend on the cross-segment 
variability in competition. This result is presented in Corollary 2. 


Corollary 2: In the ац, ... „а, case, segments with competition that is above (below) a 
weighted-average competition level have profits that understate (overstate) the 
value they provide. That is, if n> Xl y. then 7, < У, and if y, 
« х5], then м” > V,. 


The intuition for the results in Corollary 2 can be seen by examining the case of two seg- 
ments. From the corollary, in the two-segment case with ај = а, profit in segment 2 understates 
value added if and only if у, > у. In effect, with у, > у, segment 2 provides a supply-side 
subsidy that is not picked up by accounting profit calculations, because competitive pressures in 
segment 2 help drive down the wholesale price that governs activity in both segments. 

A natural follow-up question to the above analysis is how one could expect to see these 
parameters manifest in practice. While one may get a feel for the relative competitiveness of an 
industry or segment with ease, the notion of a higher or lower demand intercept is perhaps less 
natural. However, if one instead looks at profit margins, a related picture arises. In particular, 
denote the (unit) contribution margin of segment i by т, т=а-— q} — уду — м", and the 
n-segment average by m^, m" =[1/п]2 m; Given this notation, the next proposition confirms 
that the succinct comparison in Proposition ] also has an accounting analog. 


Proposition 2: Segment n's profit understates the value it provides if its profit margin is 
weaker than average (i.e., r3 < and overstates value if its profit margin is 
stronger than average (i.e., * > 5) 


In short, the proposition shows that a segment that faces greater competition and/or lower 
contribution margins is one that serves an important off-equilibrium role in boosting the fortunes 
of other segments. Besides demonstrating the underlying forces driving an inherent inter-linkage 
in segments when a firm relies on external input supply, the proposition may shed some light on 
the widely held view that in multi-segment firms, underperforming segments are propped up by 
overperforming segments (e.g., Shin and Stulz 1998; Rajan et al. 2000; Gertner et al. 2002). Since 
such findings are derived in part from accounting segment profit measures, they may not reflect the 
latent input price subsidy provided in the reverse direction—i.e., a subsidy to overperforming 
segments provided by an underperforming segment. 

Even more broadly, the proposition suggests that a firm with multiple segments that utilize 
common inputs would exhibit seemingly excessive (constrained) investment in underperforming 
(overperforming) segments, despite such behavior being value-maximizing. Viewed in this light, 
apparent inefficiencies in resource allocation may not be evidence of value destruction from 
diversification but rather reflect a form of selection bias in that multi-segment firms are inherently 
different from their single-segment counterparts (see, e.g., Chevalier 2004). Roughly stated, con- 
trasting a multi-segment firm with a linear disaggregation of its segments misses out on the fact 
that third-party relationships of presumptive, stand-alone segments can be strikingly different from 
those nurtured by the aggregate firm. Clearly, such shifts must be accounted for in order to 
calculate a meaningful segment diversification discount/premium. If this view were borne out 
empirically, then one would expect the most (least) profitable divisions of a multi-segment firm to 
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perform better (worse) than their single-segment counterparts. Further, this feature would be 

expected to be more pronounced the more critical input supply is to the multi-segment firm. 
Extending the theme, we next address if and how the latent supply-side subsidy provided by 

underperforming segments can alter the decision to enter a market in the first place. 


The Decision to Enter a Market 


The analysis thus far has demonstrated how accounting measures of segment performance can 
fail to capture the benefits of poorly performing segments due to their ability to put downward 
pressure on supplier pricing. Among other things, this view can possibly shed new light on 
seemingly widespread corporate socialism. When one thinks of criticisms of corporate socialism, 
wherein a firm uses one very successful line of business to seemingly support otherwise unsus- 
tainable segments, the notions of empire building and excessive market entry in response to 
success in a core market inevitably come to mind. To this end, we next consider the implications 
of the model for decisions to enter ancillary markets. 

Suppose the firm operates in markets 1,...,n-1 and is considering entry into market n. De- 
noting the cost of entry into market i by F;, the segment profit gleaned from entry into market л 
is simply 7, — Й». Suppose further that this proposed expansion fits the bill for empire building in 
п ш potential profit from the segment is below the average for the firm’s core segments, і.е., 
гс = In that case, from Proposition 1, entry into the new segment offers a latent benefit of 
ііі altering the prevailing wholesale price. In fact, this force, while latent, may form the 
basis for expansion into markets that yield negative profit. In other words, a firm may expand into 
new “loss-leader”’ markets, not as demand-side strategic posturing but instead as supply-side 
strategic posturing. The next proposition presents the conditions under which this phenomenon 
arises. 

Proposition 3: If Э << Е. then the segment profit from entry is negative while the value 

added ын entry is positive for ай Ер in the non-empty interval 
(F' ,F' +8,+0,), where Р' = а – ат]. 


In effect, the proposition notes that if a new segment is weaker than the existing portfolio of 
segments, then it is possible that entry therein yields negative segment profit but is nonetheless 
worthwhile. This occurs only for intermediate levels of entry costs. If the cost of entry is low 
(Е, < Е’), then the positive margins from entry are sufficient to cover fixed costs, and segment 
profit in the new segment is positive. At the other extreme, for high entry costs (F, > F' +S, 
+ D,) the entry cost is so large that even when the supply-side complementarity is considered, 
entry is not worth the cost. For an intermediate interval of entry costs (F, Є (F' , F' + 5, +D,)), 
the segment profit is negative but the supply-side subsidy makes entry worthwhile. Consistent with 
this, the larger the supply-side effect (5,), the larger the interval. 

Proposition 3 is couched in terms of implications of strategic input price adjustments on 
firms’ entry decisions. Of course, in an analogous fashion, such concerns also impact a firm’s 
decision to exit a market, where F; represents the ongoing (fixed) cost of operating a segment. The 
example in Table 1 makes the point that the failure to recognize that the wholesale price is variable 
(not fixed) with respect to the decision to keep or drop a segment can lead to a death spiral. In a 
sense, this is the "reverse" of the familiar cost-allocation death spiral in which allocation can 
create an impression that a cost is variable while it really is fixed (and sunk). Broadly stated, both 
the supply-chain death spiral and the cost-allocation death spiral stress the need to be cognizant of 
the fixed versus variable cost dichotomy. 

In Table 1, the first scenario characterizes the equilibrium outcomes when the firm operates in 
all three segments. In this case, the firm earns positive profit, but does so in the presence of a loss 
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TABLE 1 
A Supply Chain Death Spiral 


Example Parameters ( У, = 0) 


Segment 1 Segment 2 Segment 3 
Demand parameter (aj) 40 30 20 
Fixed cost (F;) 110 45 10 
Firm Operates in Segments 1, 2, and 3: (w = 15) 
Segment 1 Segment 2 Segment 3 Total 
Output level (4;) 12.50 7.50 2.50 22.50 


Profits (пі) 46.25 11.25 -3.75 53.75 


Firm Operates in Segments 1 and 2: (w = 17.5) 
Segment 1 Segment 2 Total 


Output level (9)) 11.25 6.25 17.50 
Profits (пі) 16.5625 -5.9375 10.625 


Scenario 3: Firm Operates in Segment 1: (w = 20) 





Segment 1 Total 
Output level (gj) 10 10 
Profits (пт) -10 -10 


in segment 3. The next scenario demonstrates the outcome of following a policy of exiting the 
unprofitable segment. In this case, the removal of segment 3 results in a lowering of firm-wide 
profit (from 53.75 to 10.625), in which case segment 2 arises as unprofitable. Eliminating this 
segment, in turn, leads to the final scenario, wherein only one segment remains and firm profit is 
negative. In other words, supplier pricing effects can introduce a death spiral of sorts, despite the 
fact that the fixed costs of operating in each segment are presumed to disappear along with the 
segment. Here, the death spiral arises because elimination of unprofitable segments serves to 
embolden the supplier in its pricing, thereby putting downward pressure on the margins of the 
more profitable segments. In effect, the firm’s decision of eliminating a segment is not made in 
isolation in that the supplier, too, reacts strategically in response to such a course. 


Rivals’ Reliance on the Supplier 

To this point, we have presumed a setting wherein a firm relies on the supplier for inputs but 
its rivals are self-sustaining. Suppose instead that the rivals, too, rely on the supplier for inputs. By 
providing a perspective on alternative supply-market conditions and the consequences for supply- 
side externalities from multi-market participation, this setting can serve as a robustness check on 
our underlying premise. In this case, the supplier sets п+1 wholesale prices, one for each firm 
(denoted w for the multi-segment firm and w; for rival i). As before, the firm and its rivals 
concurrently choose retail quantities subsequent to observation of the prevailing wholesale prices. 

Lemma 3 presents the equilibrium outcome under the setting with both the firm and its rivals 
reliant on the supplier, where the notation ['] distinguishes this equilibrium from that of 
Lemma 1. 
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Lemma 3: If the firm and the rivals each rely on the supplier, then the equilibrium outcome 
entails: 


г 1 а-а 
O а" 5-58) a= р and 





(1) "=>, "= а + | 

Comparing Lemmas 1 and 3, a key difference is apparent—whereas both the level of retail 
demand and the degree of retail competition played critical roles in the multi-segment firm’s 
wholesale price in the former case, the wholesale price is independent of the degree of competition 
in the latter. That is, when the supplier provides inputs both to the firm and its rival, it does not 
tailor the firm’s input price to the level of competition they face. To explain, when the supplier 
provides inputs only to the firm, it is that firm’s de facto partner, and, as such, is eager to cut 
wholesale price to give the firm an edge in competition, i.e., as у; increases, w” decreases. In 
contrast, when the rival is also the supplier’s customer, the supplier has no incentive to favor one 
customer over another. The end result is that w'" is independent of у. 

The desire to have a level playing field among competing firms extends to the wholesale price 
set for rival i, in that the rival’s price reflects the average retail market conditions of the firm, the 
extent of which is dictated by the level of competition, у;. The effect of cross-segment averages 
extends to the rivals despite the fact that each participates in only one market. This feature arises 
so that if the firm benefits from an underperforming segment by getting price cuts, some of those 
cuts are carried over to the rival so as to maintain competition and thus boost input purchases in 
that market. 

Given the induced wholesale prices and resulting competitive outcomes in the case of rival 
reliance on the supplier, the supply-side effect of participation in segment n is: 


пої 
л = Р: zir- "Er Ч 


whereas the demand-side effect is: 





пла du 1) 


Given these expressions, the next proposition demonstrates the net effect of the firm's multi- 
market participation. 


| 


Proposition 4: With rival reliance on the supplier, segment n’s profit understates the value it 
provides if its demand is weaker than average (1.е., а, < а") and overstates 
value if its demand is stronger than average (1.е., а, > а”). 


Comparing the expressions that underlie Proposition 4 and Proposition 1, it is readily seen 
that the rivals’ reliance on the supplier simplifies the expression for both the demand- and supply- 
side effects. In particular, no longer is the degree of competition in any of the markets critical, but 
instead the distinguishing factor is the comparison of retail demand in one segment to the average 
retail demand levels. This factor carries forward to the determination of whether profit over- or 
understates segment value added. In particular, the difference between segment value added (У,) 
and segment profit (п, ) can be expressed as У, — п, =S} + D! = АП" - an}, М 0. This simpli- 
Нед expression means that a concise comparison of the demand level in segment п to the equally 
weighted average of that in other segments determines the sign of the difference between value 
added and profit. 
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Thus, besides demonstrating that the underlying premise herein is robust to who relies on the 
supplier for inputs, the setting also provides some guidance in terms of how the forces should arise 
in practice. That is, while retail demand in a segment relative to average demand is consistently a 
key determinant of any latent supply-side subsidies across segments, the degree of retail compe- 
tition is crucially tied to the extent that the retail parties and the wholesale supplier form an 
alliance. 

While the analysis so far has considered the cases wherein either the rivals are not reliant on 
a supplier (as in the base model) or reliant on a common supplier (as in this section), there are also 
supply arrangements in which each retail party relies on its own dedicated supplier. The following 
example contrasts the common supplier and the dedicated supplier settings, and reinforces the 
notion that supply-side effects can manifest themselves in a myriad of input procurement relation- 
ships. 

Suppose there are two segments with a; а». Moreover, for simplicity, say y; - 0 and у, 
> 0, ће. the firm enjoys a monopoly in the first segment, while it faces a rival, rival 2, in the 
second segment. If the firm and the rival rely on a common supplier, then, from Lemma 3, the 
wholesale price is a/2 for each buyer. This is the same price that the supplier would have charged 
the firm had it operated only in segment 1. Thus, with a common supplier and a, = аз, there are по 
supply-side effects and segment profit equals segment value added. 


However, suppose the firm and the rival rely on separate (dedicated) suppliers. In this case, 
r : a[2- 1328] а[2—у, 1168454] 
the wholesale prices for the firm and the rival are “ГД | 4847-35-58 06 
Notice, the firm’s input price is less than а/2, the price the firm would have been charged by its 
dedicated supplier had it operated only in segment 1. Thus, with dedicated suppliers, there is again 
a supply-side subsidy. Also, note that the wholesale prices for both parties are decreasing in ү. In 
effect, as segment 2 becomes more competitive, the suppliers respond by lowering the input 
charge to their respective clients. In fact, this reduced input-pricing effect can be so pronounced 
that both the firm and the rival may welcome the increased competition. In the example, the firm’s 
profits are increasing in у, for у, > 0.62; the same is true for rival 2 for у, > 0.68. The role of 
multi-segment operations is critical in deriving this result. After all, if the firm and the rival each 


, respectively. 


operated only in segment 2, then each faces a wholesale price of A This price is again 
decreasing in у, but each party would be made worse off by added competition. In contrast, the 
multi-segment firm gains because the increased competition in segment 2 lowers the input price 
that applies to purchases it makes in both segments, including the segment where it enjoys a 
monopoly and, thus, high profit margins. And, relatedly, when competition is sufficiently intense, 
rival 2 gains from the firm's ceding of market share in segment 2 in order to focus on its more 
profitable segment 1. Not surprisingly, then, similar results on retail parties benefiting from in- 
creased competition also apply when the multi-segment firm is the only party that relies on the 
supplier (as in the base model). 


Price Competition 

Thus far, we have considered the supply-side ramifications of a firm's participation in multiple 
output (retail) markets in a world of Cournot (quantity) competition. As the nature of competition 
has often been shown to lead to reversals of results (see, e.g., Darrough 1993; Dye 2001, 
Verrecchia 2001), we next consider the extent to which the key implications herein are sensitive to 
such variation by revisiting the analysis under Bertrand (price) competition. To reflect that the 
competitive outcome now arises under price competition, we denote all equilibrium outcomes in 


this case with a [^]. Following this notation, then, let і = я and ё; = SEXIES and denote 
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the corresponding average values by K^ and 47, respectively. Relegating the details to the Appen- 
dix, Lemma 4 presents the equilibrium outcomes in the case of price competition. 


Lemma 4: Under retail price competition, the equilibrium outcome entails: 
5 АП 1 А Ё x, А 7 Ё А) 

(і) фіз 1[2д- ia, @ = & + тра" апа 

(i) Wr = 47 [24]. | 


Using these outcomes, ће price-competition analogs to the supply-side and demand-side 
effects from the main model are: 


А x, А n-i 
СЧ (ae НЕ, (т E 


26-18 РО * 


and 
n-i 
> 1- 706 
К р an 2 an 5 - ТУЧ п-1 2 ал 
3,--|--8--1|9.91(9 фан us xe | 
am- pile AJ e 6) a@—nee / = 6-0 


respectively. Given these expressions, the analog to Proposition 1 is as follows: 
Proposition 5: Under price competition, segment n’s profit understates the value it provides 
if < © and overstates value if 2 > ©. 
h р hob 

In brief, comparing Propositions 1 and 5 confirms that the nature of competition (quantity 
versus price) does not alter the basic conclusions of the analysis. In fact, the basic form of both the 
supply- and demand-side effects of participation in segment n that are not accounted for in 
accounting segment metrics is invariant to the nature of competition. What changes instead is the 
precise calculation of the key parameters that capture retail demand (а; versus 8,) and retail 
competition levels (k; versus Ё). 

Taken together, the results throughout Section III suggest that (1) the importance and role of 
cross-segment supply-side subsidies induced by wholesale price effects is invariant to the nature 
of retail competition and the portfolio of supplier clients; and (2) the direction and magnitude of 
such supply-side effects can be critically affected by key characteristics of the retail markets (the 
degree of competition and extent of demand among the firm's portfolio of segments). 


IV. CONCLUSION 

Determining the contribution of various segments to the overall profit of a diversified entity is 
a delicate exercise. Refined costing measures have substantially improved this process and, hence, 
led to additional reliance on segment profit measures for resource allocation choices. Yet, because 
accounting profit metrics neglect opportunity costs, segment profit figures can deviate from the 
value added by a segment. The common concern in this vein is of downstream complementarities 
brought by loss-leader products or locations as means of furthering brand recognition and/or 
product loyalty. Our study examines a divergence between segment profit and segment value 
added that arises instead from upstream considerations. 

We demonstrate that, even if product demand and long-term competitive posturing are not 
pressing considerations, a firm can nonetheless benefit from its weaker retail performers so as to 
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support more favorable supplier pricing terms. Since less profitable product lines or customer 
groups solidify a reluctance to accept markups in supplier pricing, a firm’s active participation in 
less profitable markets can help support lower input prices. This effect, in turn, points to an 
additional consideration in keep-or-drop decisions. 

While this fundamental phenomenon is demonstrated in a model wherein industrial structure 
is taken as given, further research could seek to endogenize the structure of industries and, in 
particular, the varied product offerings that a conglomerate would optimally seek in light of 
input-pricing considerations. Future study could also shed light on other forms of perceived 
“weakness” that could prove useful in supply chain relationships, such as public disclosure of 
proprietary information or even limited acquisition of relevant information. 


APPENDIX 
Proof of Lemma 1 
Working backward, given wholesale price w and rival 75 output ĝ;, the firm chooses quantity 
4, in segment i, i = 1,...,n, to maximize its total profit: 


п 


Мах > [a - 4: - Yd; "Че. (A1) 
дама Та Пра] 


Given q; rival i chooses ду to maximize its profit: 
Мома - 4, - 7412, i=1,...,n. (A2) 
а 
Jointly solving the first-order conditions of (А1) and (А2) yields: 


aj2 - yj] - 2w 
mo ш ~ 2wk; and 
ae) = ЕЕ Р” „ш ak, an 


2- + ум 
Е а iz 1, ...,n. (A3) 
4-7 
The output reaction, g,(w) in (АЗ), serves as the induced demand function for the supplier, and it 
solves: 


п п 
Max, мау) €» Max>, w[a; - 2wk;]. (A4) 

W tel и 
The first-order condition of (A4) yields w” in part (ii). Using (АЗ), the outputs listed in part (i) are 
qi(w") and gw"). This completes the proof of Lemma 1. "n 


Proof of Lemma 2 | 
The value added by segment п is Vp, V, =II"(q",q", мм") — ПО! ,q"-! , м!) Segment 
п'я realized (accounting) profit is mh, т, = ям » d, , м"). Hence, 


V, “р т, = Il'(q^, d", w") rg gw) = па (ds, In W^). (А5) 
Since пт d , w") =П" 4" d , w") + Tal Qh , d, , w"), (A5) can be written as: 
V, — m, = I7! (q^;d^,w^) - gg" ws"). (A6) 


Adding and subtracting II"7! (q^ , 9", эу"), (Аб) can be written as: 
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= g" g^ w^) – Па," 


+ Hg" ign w^) - Ig gw). (A7) 
The first term on the right-hand-side of (A7) is 5, and the second term is D,. This completes the 
proof of Lemma 2. | 


Proof of Proposition 1 
Because II"!(q, 4, w) = Eri mq, d; w) and mhg, G, w) =[а,-ч,- У – wl, the 5, 
and D, expressions in Lemma 2 can be equivalently written as: 


n-l 
stel «| апа 


1 


п-1 


р,= У, Паи – а! - gh 14-42 1-а 4. (A8) 
i-i 


From Lemma 1, 4! = 20; 587], a! = $[20,- E al wa =, wia, Substituting 
these in (A8), and making use of the identities а = 1127-1 + 10, and ји = ELR! + 16, algebra 


yields the expressions for 5, and D, provided іп ће text. 
Using S, and Ри: 


Жан where 
рою 


п-1 

24 

Sb S iara Edl- (52 = = 
4(n—1)k™! ві i=l ke (n - је“: 


The sign of V, – тг, is the same as the sign of ү - pru Note that À > 0 if and only if the 


n-i леї бус? Zir 
term T= - Ik ydg? МО E T a ‚(= |>. Because q? > 0, it follows that if 


OnE hx , Т>О. Tins the 2 is complete if it can be shown that Т > 0 even when а, 
5 oe (from 472» 0 in Lemma 1), 1/454,51/3 (with the limiting 
“ах corresponding to у= О and y,=1, respectively), a lower bound on 27142 and an upper 


Z a „| Уан J is derived below: 


(A9) 


T 
E 
d 














р (л-1)! 











м one ка” _ па" а" a па 4,027 па [V3]x 

Ха чо з а паса зро Цин eae саз 
ыла 26 (A10) 
(2 3^ 
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28 
є у | zd 2 Е ЫГ (рэ d. а ИЗД 
р (n- Пе! ро ор )/М14|л-1| ој 9[1/4] 
8a" 
ET (A11) 
From (A10) and (A11), and n 2 2, 
п-1 
= 2a" _ ыг 
T» Хун ES > Xem (A12) 
i=l 
Thus, T > 0 and, hence, Х > 0. This completes the proof of Proposition 1. ш 
Proof of Corollary 1 
In the у; = ... =, case, denote у, by у. Thus, > x = ad[2 – у] and = z-a[2- y]. Using these 
in (A9), V, — т, -X2- yl a, —a,], X > 0 and 0<у<1. This ае the proof. | 
Proof of Corollary 2 
In the a, = ... за, case, denote а; by a. Thus, + =42- У | and 5 сэн о ret Using these 
in (A9), У, - т" a ae xnl ју), х> 0 Ён a > 0. This molles the proof. и 
Proof of Proposition 2 
28: the firm operates in n-segments, its (per-unit) contribution margin in segment i is т, 
ша,- yid, ~ W^. Using з and wholesale price in Lemma 1, and substituting а; by 
42201 and y? by 4—1 / k, m; is 
т=а- i-e, (A13) 
2k” 
From (A13), m" = УУХ m= = 5. Thus, 
т" g а" 
m aeo i о (A14) 
poop ka ka 28 
From (A14), =~ ро — №. Thus, from (A9), У, - m, = МЕ ze], > 0. This completes the 
proof of Propositio п 2. : a 
Proof of Proposition 3 
Using Lemma 1 and (A13), segment n’s profit from entry is: 
kn 2 
m —F,=m,qn-F,=| о, - а" | -Е,=Е’-Е,. (A15) 
. 2k" 


From (А15), Е, > F' ensures 7, - Е, < 0. The value added by entry equals: 
V,—F,= т +5, + Dp Е, = F' + Sp t Dp Е. (A16) 


From (А16), the firm prefers to enter if F, < F' + S, + D,. Thus, for РЕ, Є (F' , F' + 5, + D,), the 
segment profit from entry is negative while the value added by entry is positive. Finally, from 
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Lemma 2 and Proposition 1, the assumed condition a,/k,< а"! К" implies У,-т, 
= 5,+ D, > 0. Thus, the interval (F’, F’+5S,+D,) is non-empty. This completes the proof of 
Proposition 3. и 


Proof of Lemma 3 
The firm's problem is again as in (A1). Given wholesale price ў; and firm output q;, rival і 
chooses 4, to maximize its profit: 


Мама - - Yidi W114), i=l, wee Mt. (А17) 
4 
Jointly solving the first-order conditions of (А1) and (A17) yields: 


( 2) а|2- 7 – Фу + умі 4 
W,W)) = = a 
di 4-4 


mp my 42-7] - 20, ум 
=, i= 1.,...,n. А18 
а й) па # ( ) 
The output reactions in (A18) serve as the induced demand function for the supplier, and it solves: 


n 


Мах У, [аки w) + wig w, wj]. (А19) 


МЕ] pal 


Solving the first-order conditions of (A19) yields w'" and #;" in part (ii). Using these wholesale 
prices in (A18), the output decisions in part (i) equal 207" у") and g(w’", w;"). This com- 





. pletes the proof of Lemma 3. a 
Proof of Proposition 4 
From Lemma 1, 4" =i тен zaja 07—12. Цав й 251» ту г, and и! = го Sub- 


stituting these expressions a the analog ie (A8), and making use of the identity а" = slg- | 
+ аһ algebra yields the expressions for 5, and D}. 
Using 5, and D;: 


V,- па -A'[a" — ал], where 


Ї 
кв Ши а" а -=| (А20) 


Using q;" from Lemma 3(i), the À' expression in (А20) can be equivalently written as follows: 


[(n-1z"-(nz"-a,)] 
vais] а пе о 


1 n-l 
Tas] n] А 


If a, > a^, from (А20), A’ > 0. And, if а, < а", from (А21), again \’ > 0. With A' > 0, from 
(A20), the sign of Vi — т is the same as the sign of а"- а,. This completes the proof of 
Proposition 4. E 
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Proof of Lemma 4 


Solving різар- 4;- уд; and 8Б,-4:-4:-Үйь the demand functions are акр;, P) 
а1-у|рњ+уР: с1-у0-Рг-үр 
B- be and os ,Бде TES re wholesale price w and rival price Б, in the 


Bertrand game, the firm chooses р; in segment i, i = 1,...,n, to maximize its total profit: 


Мах > [pi- жјаќрьР). (A22) 


різі, „Пр 


Given retail price p,, rival i chooses р; to maximize its profit: 


Махр (рљр), 1-1, Л. (А23) 
Рі 


Jointly solving the first-order conditions of (А22) апа (А23) yields: 


руд « SD +2 Т 
i = 4- № , 


а[2- y- у] + yw 


Pw) = 4- № , 


Using retail prices from (A24) in the demand functions q,(p;, p;) and ĝ;(p; , Бр yields: 


21,...л. (А24) 


а{2- у - У1-Ї2- Дю 


= й; — wk, and 
4-5узу цан, 


4”) = 


2 (уус Mea Ht ж умій 
| 4-5y + у oar 


(A25) serves as the induced demand function for the supplier, and it solves: 


із1,... п. (А25) 


Max, wG,(w) > Мах У) м й; – wk;]. (A26) 
Ww fal и il 


Solving the first-order condition of (A26) yields м" in part (ii). Using this wholesale price in 
(A25), the equilibrium outputs, noted in part (i), equal 4.77) and 4,(97). This completes the proof 
of Lemma 4. Ш 
Proof of Proposition 5 

From Lemma 4, й'з Цэа,- ial, gris РА i| w=, and wre 
а: these in the analog to (АЗ), and making use of the identities à" = 2-4"! + та, and 
ki? = nl -1 + Ik, algebra yields the expressions for $, and Dy. 
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у, d m —-— |, where 
p 
n-l 
: : > (1– 706 
| Эс £a < ! 
Ка ш ыш 150-00 XE = pee eT | (A27) 
полу || 7 p 2(n - 1) 


Tbe sign of V, ~ ® is the same as the sign of © — if > 0. Note that > 0 if and only if the 


tx y. Man 
term 7 = su. кл Зуд 


р 2(я-1)Ё”! 
Because gj > 0, it follows that if а, > эн Т > 0. Thus the proof is complete if it can be 
5 Ё 2 
Lae £ -y 


shown that Тэ» 0 even when â, < 77. Given 8, < 5 and 1/3 = 25 < 1 / 2 (with the limiting 
їх 


values corresponding to у 0 and у, = 1), a lower bound оп Т is derived below: 











n-l 
А Неба ^x > [2 – ya - YD; yt! £d 
ue ж Re Tea (AE 
E р 2(n - 1)! Ф Ё 
п-1 
Dk Y 2 22 " 
ы 20 [ёё | afm . Е) ifi. 
A a аш е күү кк. 
A(n— 1)! = р 2" Р 
a -1 Худ 
1 т" 115 1{ ka 
= 5010" - à th нні Gy -134-4 5 - 6, |. (A28) 
х і ж 
р k 
From (A28), for б, З = Т»0 and, hence, Х > 0. This completes the proof of 
Proposition 5. | 
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ABSTRACT: We examine detection and severity classification of Intemal control defi- 
ciencles (ICD) Identified under Section 404 of the Sarbanes-Oxley Act of 2002. While 
the cost/benefit balance of auditor testing of тета! controls is highly controversial, 
prior research has not examined auditor versus cllent detection of ICD, nor has it 
examined factors auditors consider in judging ICD severity. We find that auditors detect 
about three-fourths of unremediated ICD, usually though control testing. This finding 
contrasts with extant research inferring control deficiency detection effectiveness from 
publicly avallable data, underscoring the value of Section 404 auditor testing in improv- 
ing financial reporting quallty. Auditors Judge greater severity when a misstatement has 
already occurred. In the absence of a misstatement, severity is contingent on client and 
ICD characteristics, implying а more complex and nuanced judgment process without 
objective evidence of control failure. We also find that clients often underestimate ICD 
severity, but this tendency is lower among well-controlled companies with a well- 
designed Section 404 process. 


Keywords: internal controls; Sarbanes-Oxley Section 404; risk assessment; 
materiality. 


І. INTRODUCTION 
Te study investigates detection and severity classification of internal control deficiencies 
(ICD) under Section 404 of the Sarbanes-Oxley Act (SOX; U.S. Congress 2002). The 
purpose of Section 404 is to improve the reliability of information provided by public 
companies to the financial markets (COSO 2006; PCAOB 2004) by requiring company manage- 
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ment and auditors to test internal control over financial reporing (ICFR) and to disclose severe 
control flaws that are not remediated as of the balance sheet date. Disclosure of material weak- 
ness(es) (MW) involves both detection and classification (Leone 2007), but in order to isolate 
these processes for study, information is needed on the base of ICD from which MW are derived. 

Using proprietary data on detected ICD provided by several large audit firms, we address two 
research questions. First, what is the relative contribution of clients and auditors to ICD detection? 
This study is unique in its ability to examine, within the same time period, the extent of client 
(Section 404(а)) versus auditor (Section 404(b)) detection of ICD when both parties document and 
test controls. Within this analysis we examine factors contributing to client and auditor detection. 
One key feature of our study is that we provide direct evidence on the "yield" of detection 
methods used by auditors. This is an important issue at the heart of the debate on the value of 
auditor involvement in assessing and testing internal controls. Non-accelerated filers have recently 
been exempted by Congress from auditor attestation of controls effectiveness (Section 404(b)) as 
part of the financial reform legislation (U.S. Congress 2010). Emerging research (Kinney and 
Shepardson 2010; hereafter, KS) addresses the impact of auditar testing of controls using publicly 
available data. KS infer that because Audit Analytics (AA) data show that Section 404 reports for 
many companies with MW mention "material and/or numerous year-end adjustments," substantive 
testing might be sufficient to detect at least one MW for suzh companies, resulting in public 
reporting of ineffective controls. However, due to limitations of publicly available data, KS are not 
able to track misstatements to individual MW and, thus, their conclusion relates to company-level 
reporting of effective or ineffective controls. In contrast, we analyze deficiencies at the level of the 
individual control, directly examining detection method based on reports by engagement teams (as 
opposed to inferring possible detection method from annual public reports). Further, our data 
include not only MW, but also cther detected ICD judged by auditors to be less severe. 

Second, what factors are associated with the auditor's severity classifications of detected 
ICD? These classifications determine which contro! flaws are publicly reported (MW), and which 
are communicated to management and the audit committee (significant deficiency/deficiencies; 
SD).! In AS No. 2 (PCAOB 2004), SD are those deficiencies with a “more than remote" likelihood 
of failing to prevent/detect misstatements of “more than inconsequential” amount.” For MW, the 
amount of potential misstatement must be “material.” These criteria for judging misstatement 
likelihood and materiality are likely complex to apply in practice due to difficulty in interpreting 
probability phrases, as recognized by prior research (Amer et al. 1994; Asare and Wright 2008). In 
addition, the wording of classification criteria is ambiguous, in implying that a deficiency can be 
“significant” (generally defined as important or consequential), but not be “material” (capable of 
influencing the judgments of finencial report users). We study auditor judgments in the complex, 
unstructured task of severity classification, in this context of imprecise definitions and guidance. 

We investigate these issues in a sample of audits of smaller accelerated filer companies, in 
which 3,990 ICD were detected and 2,942 remained unremediated at year-end.” Regarding our 





Both classifications are consequential. Prior research shows that MW disclosure under Sections 404 or 302 is associated 
with adverse consequences to companies (e.g., Krishnan et al. 2008; Hamme:sley et al. 2008), and to company officers 
(Li et al. 2010). Because SD are not krown unless voluntarily disclosed, research has not studied adverse consequences 
to company officers from SD detection. However, SD are also important because they may aggregate to a MW, or 
evolve into MW if not remediated in a timely manner (AS No. 2, 1140). 

AS No. 2, effective during our study period, was replaced by AS No. 5 (PCAOB 2007), which also requires that 
manapement's assertion and the auditor's opinion оп [СЕВ effectiveness are supported by testing of controls. At 
inception in 2004, Section 404 applied to “accelerated filers,” i.e., companies with $75 million of common equity float, 
among other criteria. Also during our study period, the SEC suggested that company managements look to PCAOB 
guidance for ICD classification. Specific SEC guidance for management was issued after our study period. 

3 Severity classifications can only be studied for unremediated ICD. In order to examine detection and classification in the 
same sample, we restrict the detection analysis to unremediated ICD. 
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first research question, we find that clients detect fewer ICD than auditors, and are less likely to 
detect severe and pervasive ICD. Thus, many control flaws most likely to affect financial reporting 
would not be found in a client-driven process such as Section 302. Our results also show that use 
of a large accounting firm consultant for Section 404(a) work is associated with improved client 
detection. We extend this analysis by investigating how auditors detect ICD. We find that control 
tests provide initial evidence on a large proportion of ICD, including most MW and entity-level 
problems viewed as more serious by financial report users (Hammersley et al. 2008). This finding 
affirms auditors’ Section 404 control testing as an important source of detecting control deficien- 
cies, and implies that the recent exemption of Section 404(b) for smaller U.S. public companies 
could result in failure to fully realize potential improvements in financial reporting quality in that 
sector of the market. 

Our second research question considers ICD severity classification. We first investigate the 
influence of a preliminary client classification (Earley et al. 2008). We find that clients tend to 
classify ICD £s 1688 severe, but auditors frequently override those classifications. Modeling ICD 
severity classifications, we find higher severity associated with: (1) greater knowledge and inde- 
pendence in the client's Section 404(a) process; (2) more objective evidence (e.g., an existing 
misstatement); (3) control flaws other than documentation problems (e.g., inappropriate design); 
(4) certain types of entity-level ICD (e.g., Control Environment); and (5) certain types of account- 
specific ICD (revenue and tax), consistent with the regulatory climate of the period. 

Because prior research underscores the importance of objective evidence when standards are 
imprecise (e.g., Nelson et al. 2002), we further explore the role of an associated misstatements in 
ICD classification. An associated misstatement provides objective evidence that an ICD may fail 
again in the near future, but how do auditors judge severity when a misstatement has not already 
occurred? In this situation, we find that an independent client-reporting process, a longer auditor 
testing period. substantive test detection, and the number of MW identified become more impor- 
tant in classifying a given ICD as severe. Simply put, when objective evidence from a misstate- 
ment is not available, auditors use other evidence to support the classification decision. 

In summary, our results support the value of auditor involvement at two stages of the ICFR 
assessment process (detection and classification), and contribute to understanding of factors asso- 
ciated with client and auditor performance in both stages. The following section provides back- 
ground and contribution of this study, and develops our research questions and specific expecta- 
tions. Section III discusses the study's methods, and Section IV contains results of models and 
other analyses. Section V presents the study's conclusions and limitations. 


IL BACKGROUND, PRIOR RESEARCH, AND DEVELOPMENT OF RESEARCH 
QUESTIONS 
Internal Control Regulation in the U.S. and Prior Related Research 


SOX was implemented in response to numerous large-scale frauds and financial restatements 
among public companies. Among its key provisions is Section 404, which is based on the principle 
that improving ICFR should increase information reliability and restore investor confidence. This 
provision requires large public companies and their auditors to attest to ICFR effectiveness. Au- 
diting standards prior to SOX included an assessment of internal controls as an element in obtain- 
ing the required understanding of the audited company and its environment.’ Under those stan- 


An assessment of the control environment as a basis for risk assessment and planning is required in SAS No. 55 (AICPA 
1988). SAS No. 78 (AICPA 1997) builds on that standard by incorporating the five components of the COSO Frame- 
work. Under AU 325, auditors were required to disclose observed "reportable conditions" (including material weak- 
nesses in controls) to management and the audit committee or equivalent, but such communications were not made 
public. In 2003, the PCAOB adopted the AICPA Standards issued to date as interim standards. 
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dards, auditors had the option of relying on controls that were tested and found effective, or 
primarily using substantive procedures to support the financial statement audit opinion. Under 
prior standards, few auditors were familiar with documenting and testing of controls at the level 
expected by Section 404, using the COSO framework, and with the level of assurance required to 
issue a report on control effectiveness (see also KS; Curtis et al. 2009). 

Prior research uses publicly available annual report data to distinguish characteristics of 
companies disclosing ineffective controls (i.e., at least one MW) under Section 404, or quarterly 
management reports under Section 302 (Ashbaugh-Skaife et al. 2007; Doyle et al. 2007; Hoitash 
et al. 2009). However, archival research does not address the full extent of detected control flaws, 
how those problems are detected, or how auditors determine which problems are disclosed. For 
example, KS use publicly available data on misstatements mentioned in Section 404 of the annual 
reports to infer that most companies would have disclosed ineffective controls (і.е., at least one 
MW detected) if a substantive approach were used. Because studies using publicly available data 
cannot directly examine detection method, however, research with access to control-level data is 
important. Further, studies using public reports have an incomplete data set from which to draw 
conclusions on detection method or the auditor's role, as only disclosure of unremediated MW is 
required. Evidence on auditors' severity classification decisions from behavioral experiments high- 
lights the judgmental nature of these decisions. Earley et al. (2008) find that clients’ preliminary 
severity classifications influence auditor classifications, which is especially problematic when 
clients propose low severity. Also, Wolfe et al. (2009) show that management persuasion tactics 
can affect auditors' classification of information technology controls. 

In general, prior research implies that the component tasks of ICD severity classification 
(judging misstatement likelihood and materiality) are unstructured, complex, and difficult (e.g., 
Messier et al. 2005; Allen et al. 2006). One source of difficulty is interpretation of phrases such as 
“more than remote" and “material,” derived from guidance on financial statement materiality 
(SAB No. 99; SEC 1999a) and accounting for contingencies (SFAS No. 5; FASB 1975). Another 
is inconsistency in definitions. In defining MW, the PCAOB follows the FASB’s definition of 
“material” (SFAC No. 2, 4132) as something that would change the judgment of a reasonable 
person relying on the information. Yet “significant” in the title of SD generally means something 
that is important, consequential, and large. Thus, auditing standards imply that an ICD can be 
important (thus a SD) but not affect users’ decisions. Both issues lead to “ambiguity in the 
definitions” (Leone 2007, 227) of SD and MW, allowing latitude for company management in ICD 
classification. This ambiguity makes it difficult for auditors to judge the “right” ICD classification, 
and to justify that decision to client personne] whose incentive is not to report MW or SD. But to 
adhere to professional standards in a climate of liability concerns and PCAOB inspections, the 
auditor’s incentive is to detect existing flaws in ICFR and bring them to the attention of client 
management, the audit committee, and the public if necessary. This study provides the first archi- 
val evidence of ICD detection and severity classification. In the following sections, we develop 
our research expectations from prior literature, auditing standards, and the economic context of the 
early years of Section 404 implementation. 


Research Expectations 


ICD Detection 

We first focus on ICD detection, expecting that detection quality increases with expertise and 
independence of the parties performing the work. We expect that auditors will detect more ICD 
than clients, as they are independent from clients and have more knowledge of internal controls 
derived from education, firm training, and experience. We expect more detection among compa- 
nies using Section 404 consultants, who are hired for their expertise and, in addition, have more 
independence from management than client personnel (although less than the external auditor). We 
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expect a greater impact when the consultant is from a large auditing firm due to significant training 
and guidance received in those firms. The relative knowledge of consultant personnel of other 
entities is less clear, as smaller firms likely did not institute the degree of training used by larger 
firms. In terms of relative knowledge/independence, we expect that companies with internal audit 
functions will outperform companies with no internal audit function. 

In addition to these indicators of who is performing the client’s control documentation and 
testing, we also consider the level of independence built into the client’s process design. If com- 
pany management is not involved in the reporting chain, then there is less opportunity for man- 
agement influence. Thus, we expect a positive association of Section 404(a) process independence 
with client detection of ICD. We also consider the impact of integration of information technology 
(IT) in the Section 404(a) process, but do not predict a sign. While clients utilizing more advanced 
technology may detect more contro] flaws, those employing more IT may also have better under- 
lying systems (і.е., fewer manual controls). Finally, we anticipate that a longer Section 404(a) 
testing period will result in better ICD detection. 

We also investigate the nature of ICD detected by clients, an issue with important implications 
for understanding which contro! flaws are likely to be missed under the current regulatory struc- 
ture for non-accelerated filers, which does not require auditor involvement. We expect that clients 
will be less likely to discover more severe problems, due to relative lack of knowledge about 
controls and the inherent objectivity challenge of “self-assessment.” The issue of independence 
also arises in connection with client detection of entity-level ICD. Because many entity-level 
problems occur in centralized functions that directly involve top management, we expect that 
company personnel will be less likely to recognize flaws in controls at that level, relative to key 
Control Activities in areas such as sales and purchases. 


ICD Severity Classification 

Client characteristics and Section 404 processes. Our second summary research question 
concerns factors affecting auditors’ severity classification decisions. We first consider client char- 
acteristics. Specifically, we expect higher ICD severity classifications in companies with lower 
overall quality of internal control? In companies with good controls, an individual ICD is an 
isolated incident. However, in a weak-control context, auditors may consider a given ICD to be 
more severe, due to the potential for interaction with other control flaws and the potential for less 
effective compensating controls. We derive evidence on the overall controls context from the 
auditor’s control reliance decision. We also consider the number of MW detected as an indicator 
of pervasive and serious problems in ICFR specifically (see also Hoitash et al. 2008; Hermanson 
and Ye 2009). 

Our second severity expectation is that classifications will vary based on strength of the 
client’s Section 404(a) processes. Consultants’ greater expertise and independence could lead to 
better recognition of actual ICD severity, but these qualities could also result in remediation of 
more severe problems prior to year-end. Thus, we do not predict a directional effect for large firm 
or other Section 404 consultants. In addition, we expect that client Section 404(a) process inde- 
pendence will be positively associated with severity, as results of Section 404(a) activity will not 
be filtered through company management, which has the incentive not to report SD or MW. 

We also consider tbe role of internal audit, proposing a nondirectional expectation. On the one 
hand, internal audit personnel may be more adept at recognizing ICD severity than other client 
personnel, if the internal audit unit has been involved in controls design. However, companies with 


5 Prior research (Ashbaugh-Skaife et al. 2007) tests the association of control context with MW disclosure indirectly, e.g., 
using size and financial position to proxy for investment in controls. We build on that study by isolating severity 
classification decisions and using more precise measures. 
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an internal audit could have less severe control flaws (Felix et al. 2001; Lin et al. 2011). We also 
consider the extent to which IT is integrated into Section 404(a) activity. Better use of IT could 
result in detection of more severe problems, as documentation and testing of controls will be more 
complete and accurate than a manual process. However, companies with greater use of IT for 
Section 404(a) likely also have better use of IT throughout their operations and, hence, may have 
better controls. Thus, we also do not predict a sign on this variable. | 

Auditor Section 404 processes. The auditor’s Section 404(b) processes could also affect 
severity classification. First, we expect that severity classifications will be higher for auditor- 
detected ICD. Given their greater knowledge and objectivity, auditors should detect more serious 
control flaws than clients, and should be more confident of the severity implications of those they 
detect. The second factor in this category is the length of the auditor’s testing process. During our 
sample period, anecdotal evidence from audit partners noted concern that clients’ delay in com- 
pleting their work could affect the quality of the auditor’s process that follows. Further, due to 
constraints in the audit labor market during our study period, firms with heavy concentrations of 
accelerated filer clients struggled to hire and retain sufficient numbers of qualified employees to 
perform Section 404 procedures. Thus, an earlier start for auditor control testing should improve 
engagement teams’ ability to fully consider and properly classify ICD. 

Third, we consider the auditor’s source of evidence. Specifically, we expect higher severity 
classifications associated with detection of ICD through substantive tests, and with an existing 
misstatement associated with the flawed control. Substantive tests are designed to detect account 
discrepancies that are the basis for proposed audit adjustments. If a monetary discrepancy has been 
detected, then evidence of ICD severity is stronger with respect to the likelibood criterion. Also, an 
actual misstatement amount creates a “floor” for the quantitative effect of the control deficiency, 
even though the potential severity could be greater than the amount observed. The presence of a 
misstatement is important, given the widespread use of quantitative criteria for determining ma- 
teriality (e.g., Ng and Tan 2003; Messier et al. 2005). The association of misstatements with 
severity of control flaws is important to the argument proposed by KS that MW for many com- 
panies can be detected by tracing back from misstatements. If detection is through control tests 
and no associated misstatement has been detected, then the auditor’s judgment in the Section 404 
context becomes more complex—s/he must consider whether financial statement users’ decisions 
could be affected by a possible future misstatement arising from a recognized ICD, without benefit 
of an actual current misstatement available as an example. 

Prior research (Kennedy et al. 1997; Nelson et al. 2002; DeZoort et al. 2003) supports a 
positive association of misstatements with severity classification, by finding that objective evi- 
dence is important in justifying auditor decisions in unstructured tasks. These studies suggest that 
tangible evidence of control failure will improve auditors' ability to defend ICD classification 
decisions to management and audit committees, as well as in subsequent internal firm and PCAOB 
inspections. But how do auditors make severity classification decisions in the absence of a mis- 
statement? We expect that in this case, stronger client and auditor processes will help bolster the 
auditor's position in discussions with management and the audit committee over ICD classifica- 
tion. Thus, process features such as an independent reporting chain and a longer auditor testing 
period could be more significant when a misstatement has not already occurred. 

ICD type. We next consider whether severity classifications vary according to the nature of 
the ICD. AS No. 2, {133 notes that the auditor should consider the nature of the affected financial 





$ Motivating study of differential classification according to nature of the control flaw, professional standards on Section 
404 require specific consideration of azcounts and assertions (AS No. 2, paras. 68-69). While past studies find little 
difference in control risk assessments across accounts/assertions (e.g., Elder and Allen 2003), current standards suggest 
that we could find systematic differences in severity classifications across types of ICD. | 
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statement accounts in assessing the likelihood of misstatements that may result from an inopera- 
tive control. AS No. 2, 950-53 focus оп entity-level controls with pervasive effects, including IT 
general controls (ITGC), controls over the financial reporting close process, and COSO categories 
of Control Environment, Monitoring, and company Risk Assessment. While some studies address 
the importance of Control Environment in control risk assessments (e.g., Cohen and Hanno 2000; 
Cohen et al. 2002), prior research has not directly examined how entity-level status affects audi- 
tors' severity classification in the Section 404 context. 

Among account-specific ICD, we expect revenue accounts to be considered more severe. The 
SEC's strong focus on revenue recognition in the period immediately preceding the enactment of 
SOX (e.g, SAB No. 101; SEC 19995) heightened auditors’ awareness of the importance of 
controls over revenue recognition. We also predict a positive association of tax ICD with severity. 
In response to concerns about auditor independence, the SEC prohibited auditors from performing 
certain accounting services following passage of SOX, including preparation of the tax accrual. 
The transition away from auditor assistance to companies in tax accrual accounting to meet 
SEC/PCAOB independence requirements suggests higher likelihood of a material misstatement in 
tax accounts during our study period. 

Professional guidance and some prior research on the second component judgment, material- 
ity, also suggests differential classification according to the nature of the account. The definition of 
materiality in professional auditing standards is based on the auditor's judgment that an item 
would make a difference in users' decisions. AS No. 2, 123 notes that the auditor should consider 
both quantitative and qualitative elements, as in the financial statement materiality judgment (SAB 
No. 99; SEC 19992). Qualitative factors noted in the standard include the nature of accounts and 
assertions, and future misstatement consequences. Prior research mainly considers materiality 
judgments in audit planning and audit adjustments. Summarizing this literature, Messier et al. 
(2005) conclude that during planning, the primary criterion for materiality in practice is income- 
based, although other criteria are considered. As noted above, judgments of materiality are likely 
more difficult when an existing misstatement is not available to provide a gauge for the likely 
amount of future misstatements resulting from the ICD. 

In sum, this research builds on prior studies of publicly disclosed Section 404 MW by 
investigating detection and severity classifications of ICD discovered by companies and auditors. 
In the following section, we discuss our procedures and methods of testing expectations. 


Ш. METHOD 

Sample and Data Collection Procedures 

We obtained data from several large audit firms, under confidentiality agreements that restrict 
us from knowing client identities and reporting separate results by firm or firm size. То guide 
sample selection, we asked that each firm randomly select from 2004—2005 engagements of 
smaller accelerated filers (with revenues of about $1 billion or less) in non-regulated industries. 
The restriction on client size is to improve our ability to generalize to the more than 10,000 
smaller U.S. public companies not subject to Section 404(b) and to listed companies in other 
countries. Contact personnel from participating firms helped develop a spreadsheet to be com- 
pleted by engagement teams, containing both company-level and control-level information. 


Variable Definitions and Coding 

Table 1, Panel A describes company-level variables representing auditors' and clients' Section 
404 processes and outcomes, and Panel B describes control-level variables. We measure contro] 
quality using ІС RELIANCE, the auditor's level of reliance on internal controls. This is an indi- 
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cator variable equaling 1 if “strong” or “maximum,” and 0 otherwise.’ CONSULTANT_BIG6 
equals 1 if the client outsourced all or part of its Section 404(a) activity to one of the largest six 
accounting firms, and 0 otherwise. CONSULTANT_OTHER equals 1 if the client outsourced its 
Section 404(a) activity to a smaller accounting firm or other provider. REPORT_INDEPENDENT 
equals 1 if Section 404(a) results are reported to the audit committee and auditor independent of 
client management, and 0 otherwise. INTERNAL_AUDIT equals 1 if the company has an internal 
audit function, and 0 otherwise. We capture the auditor’s assessment of the company’s effective- 
ness of integrating IT using a five-point scale (T_INTEGRATION). MONTHS PRIOR CLIENT 
and MONTHS PRIOR AUDITOR measure the number of months before the balance sheet date 
that clients (auditors) began tests of controls. Table 1, Panel A also describes measures of the 
number of ICD at each severity level per engagement: NUM. MW, NUM D, and NUM DEF Тог 
MW, SD, and deficiencies, respectively. 

Control-level variables are described in Table 1, Panel B. Column headings indicate whether 
the ICD was remediated by year-end, and if not, the auditor's final severity classification. If the 
ICD was detected by the client, then the instrument asks for the client's preliminary severity 
classification, if known. DETECTED AUDITOR equals 1 if detected by the auditor (or auditor 
and client jointly). We also asked firms to indicate the phase of the audit in which the ICD was 
detected (planning, internal control assessment/testing, substantive testing, or review). DETECT- 
ED_SUBS equals 1 if the ICD was detected by substantive tests, and 0 otherwise; DETECTED- 
_CONTROL equals 1 if detected by substantive tests, and 0 otherwise. If the ICD was associated 
with a misstatement, then MISSTATEMENT equals 1, and 0 otherwise? We developed additional 
coding rules from professional sources, verified by contact personnel at the firms providing data. 
We classified each ICD by the COSO components: Control Environment, Monitoring, Risk As- 
sessment, Information/Communications, and Control Activities. We use AS No. 2 to determine 
which ICD are entity-level; i.e., those in the first four COSO categories (ENTITY. CONTROLEN- 
VIR, ENTITY. MONITORING, ENTITY. INFOCOM, and ENTITY. RISKASSESS), IT general con- 
trols (ENTITY. ITGC), and general ledger controls (ENTITY. GL). The indicator ENTITY equals 1 
if the ICD is in any entity category, 0 otherwise. '? 

When applicable, ICD are classified by financial statement accounts/cycle, based on the 
COSO (2006) small business framework and guidance from some participating firms. These are: 
ACC PURCHASES, ACC REVENUES, ACC COMPENSATION, ACC FIXEDINTANG (fixed 
and intangible assets), ACC TAXES, ACC. LIABILITIES (including leases, warranty and legal 
liabilities), and ACC. TREASURY (including investments and bank account management). We also 
classified each ICD by the nature of the control weakness, with categories derived from COSO and 


Control reliance was assessed using the following scale: 5 = maximum reliance; 4 = significant reliance; 3 = some 
reliance; 2 = no reliance; 1= not applicable. Information on current-year reliance is missing for ten engagements. То 
enable use of this variable, we construzt an indicator equaling 1 for maximum and significant reliance, and 0 otherwise. 
Measurement error from collapsing categories should bias against finding an effect of control reliance. 

While auditing standards indicate that auditor testing should follow client testing within a given area, engagement teams 

indicated that for 37 ICD (1.3 percent), discovery was made by client and auditor jointly. Assuming that the clients 

might have missed these ICD without auditor involvement, we classify them with auditor-detected ICD. 

? Asking auditors to code associated misstatements was suggested by personnel of one of the participating firms, and so 
this question was asked of some but not all engagement teams. For remaining engagements, the authors identified 
associated misstatements from the text of the ICD description. If, for those engagements, the text did not mention a 
related misstatement when one existed, then the ICD would have been classified as not misstatement-associated. This 
should bias against finding differences between subsamples with/without associated misstatements. 

10 We classify control flaws related to IT general controls (ТРОС) based on AS No. 2 and guidance of the IT Governance 

Institute (2004) as system security, operations, development. and change process controls. General ledger issues relate 

to the following processes: financial statement close process, adjustments and accruals, allocations (e.g., depreciation 

and amortization), financial statement preparation, nonroutine entries, general ledger maintenance, SEC filings, and 
foreign currency. 
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AS No. 2: DESIGN (the control does not exist or cannot meet its objective); DOCUMENTATION 
(the control exists but is not documented; INEFFECTIVE (the control exists but does not func- 
tion); INSUFFICIENT_TESTING (the control exists, but has not been adequately tested, perhaps 
due to insufficient time prior to year-end)."! 

Table 2 provides names and definitions for company-level control variables, based on prior 
research showing that MW disclosure is associated with company size, financial condition, com- 
plexity, and earnings quality (e.g., Ashbaugh-Skaife et al. 2007; Doyle et al. 2007 ).!? We measure 
company size using the natural log of total assets (LNASSETS), and financial condition using total 
liabilities divided by total assets (LEVERAGE) and cash flow from operations (CFOTA). We also 


TABLE 2 


Sample Composition and Descriptive Statistics on Company-Level Control Variables 
Mean (Std. Dev.) or Percent = 1 


All Effective Ineffective 
Engagements Controls Controls 
(n = 76) (n = 61) (n = 15) 
LNASSETS = natural log of assets. 19.52 19,54 19.43 
(116) (0.95) (1.21) 
LEVERAGE = total liabilities/total assets. 0.43 0.44 0.40 
(0.22) (0.22) (0.17) 
СРОТА = cash flow from operations/total —0.150 —0.012 —0.28 
assets. (0.19) (0.20) (0.24) 
ACCRUALS = residuals from a regression of 0 —0.007 0.027 
а а: (0.09) (0.10) (0.05) 
ARINVIA = (accounts receivable + inventory)/ 0.29 0.27 0.36 
total assets. (0.18) (0.18) (0.18)** 
SEGMENTS = number of business segments. 2.51 238 3.07 
(2.32) (2.01) (3.35) 
LOCATIONS = number of significant 4.87 4.84 5.00 
locations. (5.10) (5.37) (3.95) 
NEW_PUBLIC = 1 if client has been a public 30.3% 24.6% 53.34%** 
company less than five years, and 0 
otherwise. 
INDUSTRY. MFG = 1 if client's industry is in 56.6% 55.7% 60.0% 
the manufacturing sector (one-digit SIC = 2 
or 3), and 0 otherwise. 
ЕҮЕ, 2004 = 1 if fiscal year is 2004, and 0 50.0% 47.5% 60.0% 


otherwise. 


ЖЕ Indicates <0.05, significant effects in two-tailed tests. 

This table presents descriptive statistics for company-level control variables, for the entire sample and for engagements 
with effective/ineffective ICFR. We test differences between companies with effective/inefiective ICFR using t-statistics 
for continuous variables and ү? tests (d.f. = 1) for dichotomous variables. 


! To assure coding reliability, we used a’ practice sample of 73 ICD in two engagements, identified issues that reduced 
agreement, and refined our coding rules. We then independently coded 625 ICD from four engagements, with 97.2 
percent agreement. The remaining ICD were coded by one or the other author. | 

12 To protect client identities, financial information was disguised by engagement teams through multiplying by a factor 
between 0.95 and 1.05. 
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construct a measure of earnings quality by regressing net income on cash flow from operations 
within industry categories. The residuals from this regression constitute the variable 
ACCRUALS.” We measure complexity as the ratio of accounts receivable and inventory to total 
assets (ARINVTA), the number of reportable business segments (SEGMENTS), and the number of 
significant locations (LOCATIONS). We also include indicator variables for companies that have 
been public fewer than five years (NEW_PUBLIC), manufacturing companies 
(INDUSTRY_MFG),"* and fiscal year using FYE_2004. 


Models 


We address our first research question in two ways: (1) by examining the overall percentage 
of ICD detected by clients/auditors; and (2) by modeling factors associated with likelihood of 
client detection. The latter analysis uses Model 1, a logistic regression model whose dependent 
variable is DETECTED_CLIENT, which equals 1 if the ICD was detected by the client, and 0 if 
detected by the auditor or auditor/client jointly. З Because we expect that clients are less likely to 
detect severe ICD, we expect negative coefficients on indicator variables SD, MW, and ENTITY. 
Because greater knowledge and independence from management of the parties performing the 
work should result in higher likelihood of client detection, we expect positive signs on CONSULT- 
ANT_BIG6, CONSULTANT. OTHER, REPORT. INDEPENDENT, and INTERNAL AUDIT. Сот- 
panies with better ГГ. INTEGRATION might perform better in ICD detection, but have better 
underlying controls, so we do not predict a sign on that variable. We expect a positive sign on 
MONTHS, PRIOR CLENT, as a longer Section 404(a) process should help clients detect more 
ICD. We also expect a positive sign on LNASSETS, as larger companies should have more re- 
sources to devote to the process. 

We investigate severity classification using two methods. First, because auditors are some- 
times aware of the client's preliminary classification of ICD, we first test whether auditors override 
those classifications by judging ICD to be more severe. Second, we use two logistic regression 
models to test expectations regarding factors associated with severity classification of ICD, fol- 
lowing the two-stage process implied in the definitions of SD and MW in AS No. 2 and explicitly 
featured in the profession’s white paper (BDO Seidman LLP et al. 2004). Model 2 (whose 
dependent variable is SD/MW; 1 = MW or SD, 0 = Deficiency), investigates factors associated 
with ICD meeting the criterion for a “more than remote” likelihood of failing to detect or prevent 
a misstatement (.е., at least SD classification, implying that the ICD must be reported at least to 
management and the audit committee). Model 3 is tested on the subsample of ICD having met the 
likelihood criterion, investigating factors associated with the auditor’s judgment that misstate- 
ments resulting from the ICD could be material (which determines public disclosure of ineffective 
ICFR). The dependent variable of Model 3 is MW (1 = MW, 0 = SD). 

In Models 2 and 3, we expect more severe ICD among clients with lower control quality. 
Control quality is measured using /C RELIANCE and the number of MW per engagement (due to 


13 Data limitations prevent us from estimating models to separate normal from abnormal accruals. 

14 Our sample comprises public entities in non-regulated industries. Most sample companies are in manufacturing (SIC 
codes ranging from 2000 to 3999; 56.6 percent). Because for 19 of our 76 engagements, engagement teams did not 
provide a SIC code, we construct an indicator variable comparing manufacturing to all other industries. 

15 Because there are multiple ICD for each company, all models are estimated with clustering on engagement ID (i.e., 
company/year), producing robust standard errors. Clustering is an important feature of our empirical tests, but it causes 
estimation difficulties when using some variables in smaller subsamples, due to insufficient numbers in one or both 
categories of the dependent variable. In that case, we omit as few variables as possible to enable estimation. 

16 Use of two models allows coefficients to vary across the two stages. To investigate whether this approach is statistically 
indicated, we estimated an ordered probit with a three-level dependent variable, and applied the Brant test for propor- 
tional odds. This test is significant (p < 0.01), indicating that the parallel regression assumption is violated and the 
two-model approach is appropriate. 
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skewness, we use NUM MW SQRT, the square root of the number of MW detected). We also 
investigate the association of ICD severity classification with client process measures." We do not 
predict a sign on either CONSULTANT variable, as consultants may detect more severe control 
flaws, or enable earlier identification and remediation of severe flaws. We expect the sign on 
REPORT INDEPENDENT to be positive, as it implies that Section 404 results are not filtered by 
management's incentives to classify ICD as less severe. We do not propose a sign on 
INTERNAL AUDIT. Companies with an internal audit function might have less severe control 
flaws prior to Section 404 detection processes, and might also be better able to recognize severity 
than client personnel without internal audit experience. The sign on ГГ INTEGRATION is also 
indeterminate. Companies with effective IT integration in the Section 404 process could perform 
better at detecting severe ICD. However, those companies likely also have better IT controls in 
general, meaning fewer severe problems should exist. 

Models 2 and 3 include four auditor processes variables. We expect positive coefficients on 
DETECTED AUDITOR (given auditors’ greater knowledge and independence) and on 
MONTHS. PRIOR AUDITOR (a longer auditor testing period provides more time to detect serious 
control flaws and consider their implications for financial reporting). We expect a positive sign on 
MISSTATEMENT, as a current misstatement provides objective evidence for assessing the likeli- 
hood and materiality of future misstatements. We also expect higher severity classifications for 
ICD detected with substantive tests (DETECTED SUBS). 

Regarding ICD type variables, we expect that entity-level ICD will be more severe, with one 
exception. Professional guidance during our study period (BDO Seidman LLP et al. 2004) indi- 
cates that regardless of the criteria of AS No. 2, auditors should classify ITGC consistent with 
underlying process controls. Thus, we do not propose a directional expectation on ENTITY. ITGC. 
Also, relative to the ICD in control documentation that remain in tbe intercept, we expect positive 
signs on DESIGN, INEFFECTIVE, and INSUFFICIENT. TESTING, as documentation problems 
should be more easily remediated than other types of control flaws. 

We derive our predictions for company-level control variables from prior research on factors 
associated with public disclosure of MW (e.g., Doyle et al. 2007; Hoitash et al. 2009). We expect 
a negative sign on LNASSETS, as prior research shows smaller companies have more serious 
internal control problems. We expect that poor financial health will be associated with more severe 
ICD, generating a positive sign on LEVERAGE and a negative sign on cash flow from operations 
(CFO). We expect an association between ICD severity and ACCRUALS, based on prior research 
showing earnings quality to be lower among companies with internal control flaws (e.g., 
Ashbaugh-Skaife et al. 2009). We also expect that company complexity will be positively associ- 
ated with ICD severity, including АКІМУТА, SEGMENTS, and LOCATIONS. More severe ICD 
should be found among new public companies (МЕУ PUBLIC). These models also control for 
industry membership using INDUSTRY. MFG, and for fiscal year using FYE 2004. 


IV. RESULTS 
Description of the Sample 


The sample comprises 3,990 specific ICD detected in 76 engagements for 44 companies. An 
auditor's Section 404 opinion of ineffective ICFR was given for 15 sample engagements (19.7 


17 Companies’ choices of Section 404 processes (e.g., to use a Section 404 consultant) could te endogenous. Prior research 
does not address factors associated with these choices; thus, there is little guidance in the literature on what measures 
could be used as instruments. Ávailable variables within our sample are few, and none fit the criteria for effective 
instruments: a strong tbeoretical link, statistical association with the selection variable, and no association with the 
outcome variable (severity classification). In such a situation, Larcker and Rusticus (2010) advise against formal 
modeling of selection, as greater bias can be introduced by use of poor instruments. 
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percent, 28 Untabled analysis shows that an additional 51.4 percent of sample companies have at 
least one SD, implying that only 28.9 percent had no ICD of sufficient severity to be reported, at 
least to the audit committee. At the individual control level, 153 are MW (3.8 percent of detected 
ICD), a mean (median) of 10.2 (2.0) MW per company with ineffective controls. Most companies 
with ineffective controls have few MW, but some have a large number of ICD that auditors 
identified as MW or components of MW. However, this does not imply that each of these indi- 
vidual control flaws would be disclosed separately in Section 404 reports. Such disclosures vary in 
their level of specificity; for instance, individual deficiencies might be aggregated for discussion in 
the text of the opinion (e.g., several control flaws in revenues or several issues relating to the 
Control Environment). SD comprise 11.6 percent of sample ICD, and Deficiencies 58.4 percent, 
while 25.7 percent were remediated as of the balance sheet date. Details on ICD type by severity 
level are shown in Table 3.’ 

Table 2 presents descriptive statistics for company-level variables, and tests differences ђе- 
tween subsamples with effective versus ineffective controls. The mean log of total assets is 19.52 
(the mean in dollars is $502.5 million), mean leverage is 0.43, and mean CFOTA is —0.150. The 
mean of ACCRUALS (the residual of regressing net income on cash flow from operations) is zero, 
by construction. The mean ratio of ARINVTA is 0.29; this ratio is larger for companies with 
ineffective ICFR (0.36 versus 0.27, p = 0.048). On average, sample companies have 2.51 seg- 
ments and 4.87 locations. The overall percentage of companies that have been public fewer than 
five years is 30.3 percent; companies with ineffective controls are significantly more likely to be 
newly public (24.6 versus 53.3, p = 0.030). Over half of the companies (56.6 percent) are in the 
manufacturing industry. 


Descriptive Statistics on ICD Detection 


Table 1, Panel A describes company-level Section 404 process variables. Panel A shows that 
control reliance is higher in engagements with effective controls (54.1 versus 26.7 percent, 
p = 0.057). Consultants from large auditing firms were used for Section 404 activity by 13.2 
percent of sample companies, while 63.2 percent used a smaller auditing firm or another 
provider.” Results of the client's Section 404 activity were reported independent of management 
in 34.2 percent of the sample. Thus, in about two-thirds of sample companies, Section 404 results 
were filtered through management before being made available to the auditor. Just under half of 
the companies (46.1 percent) have an internal audit unit. Companies with ineffective controls have 
a lower mean JT. INTEGRATION than those with effective controls (2.93 versus 3.39 on the scale 
of 1 — highly ineffective to 5 — highly effective, p — 0.035). Clients (auditors) began control tests 
an average of 5.82 (4.21) months before year-end,” Panel A also shows that the mean number of 
MW among engagements with ineffective controls is 10.20, while the median is 2.01, indicating a 


18 Bedard et al. (2007) note that the overall proportion of smaller accelerated filers with ineffective controls under Section 
404 is 20 percent in 2004 and 14 percent in 2005. Our sample proportions of companies with ineffective ICFR in 2004 
(2005) are 23.7 (15.8) percent, not significantly different from the Bedard et al. (2007) disclosure proportions (p = 0.345 
and p = 0.444, respectively). 

19 For efficiency, we do not present a matrix of correlations among independent variables. Estimating the severity models 
using regression to obtain variance inflation factors, we find that the highest із 4.42 (4.41) for NUM_MW_SQRT in 
Model 2 (3), lower than the level of 5 often used to indicate concern for multicollinearity. 

20 Untabled statistics show that the consultant managed the work in 35.1 percent of companies using a consultant, 
performed controls documentation in 68.4 percent, and controls testing in 86 percent. 

21 There were numerous reports in the business press in the first year of Section 404 implementation of concern that some 
companies delayed their start of testing too long to do an effective job. In supplemental analysis, we investigate whether 
the client’s Section 404 testing began earlier for 2005 engagements in our sample than for 2004 engagements. In 2004, 
client testing began 5.28 months before year-end, while in 2005 client testing began 6.34 months prior to year-end. This 
difference is significant (1 = 2.087, р == 0.020). 
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highly skewed distribution. The mean number of SD for companies with ineffective (effective) 
controls is 18.27 (3.07), a significant difference (p = 0.030). The overall mean number of Defi- 
ciencies is 30.63. 

Table 1, Panel B shows that auditors detected 72.1 percent of ICD. This statistic addresses our 
first research question, showing that auditors detect a large proportion of ICD even though clients 
are both documenting and testing controls. This percentage differs among severity classes of 
unremediated ICD (p = 0.001); it is highest for MW (84.3 percent) and lowest for deficiencies 
(74.7 percent). Panel B also shows that the percentage of misstatement-associated ICD increases 
with severity (p < 0.001), as does the percentage detected with substantive tests (p < 0.001). The 
percentage of ICD detected by control tests decreases with severity (p = 0.042). 

Table 4, Panel A provides further evidence on detection method by documenting the types of 
ICD detected by auditors with substantive (control) tests. Column 1 shows that 81.5 percent of 
unremediated entity-level ICD are initially detected through control tests, while only 15.1 percent 
are detected through substantive tests. Within entity-level categories, control test detection rates 
are above 80 percent for all except general ledger controls, which include adjusting and closing 
entries. Account-specific ICD are also more likely to be detected though control tests, with tax 
accounts having the lowest of the listed percentages (64.8 percent overall). The only ICD type for 
which substantive and control test detection is roughly equivalent is misstatement-associated (45.0 
versus 44.1 percent). Panel B shows that the percentage of misstatement-associated ICD detected 
with control tests is lower than for other ICD (p < 0.001). Columns 2 and 3 show a pattern of 
decreasing control test detection and increasing substantive test detection with increasing severity. 
Panel B shows that overall, the percentage of ICD detected with control tests decreases with 
severity (p < 0.001). Overall, this analysis shows that while more severe ICD are less likely to be 
detected through control tests, control tests still provide initial evidence for most MW. These 
results based on control-level data cast doubt on the effectiveness of relying on misstatements 
discovered by auditor substantive procedures to detect key flaws in controls over financial report- 
ing, as suggested by KS from company-level data. 


Client ICD Detection and Under-Classification 


Table 5 presents results from estimating Model 1, examining factors associated with client 
ICD detection, for all unremediated ICD and for SD/MW only, along with descriptive statistics on 
model variables. Results for all unremediated ICD show that clients are less likely to detect ICD 
ultimately classified as MW or SD (both at p = 0.011) as well as entity-level ICD (p = 0.024). 
Client detection is more likely for larger clients (p = 0.005) and those using a large audit firm 
consultant (p — 0.001). However, independent reporting chain, internal audit function, IT integra- 
tion, and longer client testing process are not associated with better client detection. Among 
SD/MW only, a large audit firm consultant again improves detection (p = 0.005) and entity-level 
ICD are less likely to be detected by clients (p = 0.038). In sum, Model 1 shows that clients are 
less likely to detect more severe and pervasive ICD, but use of large audit firm consultants and 
greater resources improves client detection. 

According to AS No. 2, auditors should be aware of ICD detected by clients before beginning 
work in a given area of the audit, Because the auditor may also know the client’s preliminary 
severity classification, we compare client and auditor ICD classification in these situations. Table 
6, Panel A shows that 71.4 (64.8) percent of client-detected MW (SD) were initially under- 
‘classified by clients. These percentages are both significantly different from chance (p = 0.005 
and p = 0.023, respectively), implying that auditors tend to override client classifications in a 
significant number of cases. Client under-classification could be due to lack of knowledge of the 
impact of specific controls, or an attempt to influence the auditor’s classification to a less severe 
level. 
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TABLE 4 
Supplemental Analysis of Detection Method for Auditor-Detected ICD 


Panel A: Percent Detected with Control (Substantive) Tests, by ICD Characteristics 


1. All 
Unremediated 2. SD and MW 3. MW 
Overall 82.1 72.7 66.7 
(14.3) (22.8) (28.7) 
Entity Level 81.5 74.1 63.3 
(15.1) (22.3) (33,3) 
ENTITY_CONTROLENVIR 83.7 88.6 100.0 
(9.9) (2.9) (0.0) 
ENTITY MONITORING 83.3 77.3 60.0 
(14.5) (21.2) (40.0) 
ENTITY. GENERAL, LEDGER 69.7 55.8 56.8 
(26.2) (39.5) (37.8) 
ENTITY. ITGC 93.3 96.6 80.0 
(5.0) (2.6) (20.0) 
ENTITY. RISKASSESS 100.0 100.0 100.0 
(0.0) (0.0) (0.0) 
Account-Specific 19.1 66.1 64.4 
(17.3) (30.1) (32.7) 
ACC_PURCHASES 81.6 710 72.0 
(14.9) (23.4) (20.0) 
ACC_REVENUES 79.1 72.0 66.2 
: (16.5) (25.0) { (30.8) 
ACC_COMPENSATION 84.0 64.9 66.7 
(13.3) (35.1) (33.3) 
ACC_TAXES 64.8 46.9 40.0 
(29.7) (40.6) (60.0) 
Misstatement 45.0 30.6 35.9 
(44.1) (57.3) (48.7) 
Panel В: Tests of Significance of Difference in Detection Method by ТСО Characteristics 
Percent Detected 
with Control Tests Хил 
MW and SD 72.196 39.5*** 
Deficiencies 84.9% 
Misstatement Associated 45.0% 341.1*** 
All other ICD 88.2% 


ЖЕЖ Indicates significance at 20.01, 

This table presents supplemental analysis of detection method, among those ICD detected with auditor involvement, for 
which that information is available. 

Panel A shows percentages of control (substantive) test discovery for all unremediated ICD (Column 1), SD/MW (Column 
2), and MW (Column 3). Percentages may not add to 100 due to other possible methods of detection. 


Panel B provides tests of detection method by severity, and between misstatement and no-misstatement subsamples. 
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Table 6, Panel B presents supplemental statistics on client classification of ICD whose final 
classification is SD or MW, and results of estimating a logistic regression model whose dependent 
variable (UNDER CLASSIFICATION) equals 1 if the client's preliminary classification is lower 
than the auditor's final classification, and 0 otherwise. We expect that clients might not recognize, 
or might seek to downplay, more severe (МУУ) or pervasive (entity-level) problems. Consistent 
with this expectation, the coefficient on MW is positive and significant (p « 0.001). However, 
entity-level ICD are unexpectedly less likely to be under-classified (p — 0.035, two-tailed).” We 
also expect that clients with stronger Section 404(a) processes should be less likely to under- 
classify severity. Model results show that under-classification is less likely when the client has an 
independent reporting chain (p « 0.001), consistent with this expectation. Although Model 1 
shows that large firm consultants improve ICD detection, Table 6 shows that independence in the 
reporting process plays a greater role in accurate client severity classification.” 


Results of Estimating Models 2 and 3: Auditors’ Severity Classification Decisions 


Table 7 presents results on factors affecting auditors’ severity classification decisions, inves- 
tigated using Model 2 (distinguishing SD/MW from Deficiencies) and Model 3 (distinguishing 
MW from SD). These models have a Pseudo-R? of 0.352 and 0.437, respectively. 


Client Characteristics and Section 404 Processes 

Model 2 results show a negative and significant coefficient on ЇГС RELIANCE (р = 0.013) 
and a positive and significant coefficient on NUM MW. SQRT (p « 0.001). Both imply that, when 
the client's overall control quality is lower, an ICD is more likely to be classified as SD relative to 
a deficiency. Concerning Section 404 process strength, Model 2 shows a positive coefficient on 
use of a large accounting firm as consultant (p « 0.001), but not for other consultants. SD/MW 
classification is also more likely when Section 404 reporting is independent of management 
(p = 0.001). The coefficient on IT. INTEGRATION in Model 2 is negative and significant 
(p = 0.012), implying that better underlying IT controls are associated with less severe ICD. The 
presence of an internal audit function is not significant. Model 3 results show factors that contrib- 
ute to distinguishing MW from SD. Regarding the overall controls context, NUM MW SQRT is 
again significant (p « 0.001), but ІС RELIANCE is not. An independent reporting chain is also 
positive and significant in Model 3 (p — 0.037). 


Auditor Section 404 Processes 

Model 2 results show that as expected, ICD detected by auditors are more likely to be 
SD/MW (р = 0.008). The length of the auditor testing period is also positive and significant 
(p = 0.003), suggesting that auditors who take more time to consider consequences of ICD 
classify them as more severe. ICD detected with a current misstatement are more likely to be 
SD/MW (p < 0.001), as are those discovered through substantive tests (р = 0.074). In Model 3, 


21 While Model 1 shows that clients are less likely to detect entity-level problems, this result implies they are more aware 
of their significance when detected. 

З Although the descriptive statistics in Panel B include ІС RELIANCE, this variable is not included in the model, as its 
presence prevents estimation. This is likely due to the very low extent of under-classification among well-controlled 
companies. Table 5, Panel B shows that only 6.1 percent of ICD of clients of with higher control reliance are under- 
classified, relative to 89.5 percent of other clients. 

М Tn Model 1 we find that large accounting firm consultants are associated with better ICD detection, and in Model 2 we 
find they are associated with higher severity classifications. These results could be explained by a tendency for poorly 
controlled companies to hire Big 6 consultants, but we find no association of use of large firm consultants with internal 
control reliance (p= 0.555). Model 2 also shows that an independent reporting chain is associated with higher severity 
classifications, which could be due to poorly controlled companies designing their processes in that manner. However, 
we also find no association of independent reporting with internal control reliance (p = 0.869). Thus, the more likely 
explanation is that use of those processes leads to higher quality Section 404(a) outcomes. 
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TABLE 7 
Models 2 and 3: Control, Process, and Company Factors Associated with Severity 
Classifications 
Model 2: 
SD and MW versus Model 3 
Deficiencies MW versus SD 
Client Characteristics and Detection Processes 
IC RELIANCE (~) —0.945** 0.086 
TOTAL, MW. SQRT (+) 0.308 **** 0.763* ЖЖ 
CONSULTANT. BIG6 1.892*** -1.473 
CONSULTANT_OTHER 0.244 0.257 
REPORT_INDEPENDENT (+) 1.464*** 3.224** 
INTERNAL_AUDIT -0.217 2.266 
IT_INTEGRATION —0.527** 0.705 
Auditor Detection Processes 
DETECTED AUDITOR (+) 0.449*** 0.240 
MONTHS_PRIOR_AUDITOR (+) 0.324*** -0.680 
MISSTATEMENT (+) 0.849*** 0.636 
DETECTED_SUBS (+) 0.559* 0.991* 
ICD 
ENTITY. CONTROLENVIR (+) 0.434 2.603 
ENTITY. MONITORING (+) 0.228 –0.7441 
ENTITY_INFOCOM (+) -1,6127 
ENTITY_RISKASSESS (+) — 1.7391 
ENTITY GENERAL, LEDGER (+) 0.425** 0.433 
ENTITY_ITGC 0.445* —1.113* 
ACC_REVENUES (+) 0.933" ** 2.060*** 
ACC_TAXES (+) 0.851** 0.839 
ACC_PURCHASES 0.392 0.811 
ACC_COMPENSATION 0.191 
ACC. FIXEDINTANG —0.755* 
ACC. LIABILITIES —0.208 
ACC. TREASURY 0.496 
DESIGN (+) 1.183*** 0.993*** 
INEFFECTIVE (+) 0.760*** 0.928*** 
INSUFFICIENT _TESTING (+) 0.627 *** 0.817** 
Control Variables: Client Characteristics 
LNASSETS (-) —0.245 0.584 
LEVERAGE (+) -0.563 1.980 
CFOTA (-) 3.715 —6.717** 
ACCRUALS (+) 11.817*** —0.074 
ARINVTA (+) —4.915ttt 7.002** 
SEGMENTS (+) -0,249"" 0.031 
LOCATIONS (+) -0.009 —0.232 
NEW. PUBLIC (4-) 0.848 * ** 3.777#% 
FYE 2004 (-) -0.361 -0.093 
INDUSTRY MFG —0.979*** —1.488 
Constant 3.655 —22.423 
n 2,909 611 
Pseudo-R? 0.352 0.437 
(continued on next page) 
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TABLE 7 (continued) 


+ ЖЖ Жжж Indicate significant effects at <0.10,<0.05, and 0.01, respectively, probability levels are one-tailed for 
directional expectations indicated in the table. Values in the cells are model coefficients. 

Я, TH Indicate significance in the opposite direction to that predicted, at р 20.05 end p <0.01, respectively (two-tailed), 
This table shows results of estimating Models 2 and 3, logistic regressions estimated using clustering on engagement ID. 
Model 2 explains factors differentiating deficiencies from SD (i.e., the ICD meets criteria for disclosure at least to the audit 
committee), and Model 3 explains factors differentiating SD from MW (i.e., the ICD meets criteria for public disclosure). 
Independent variables are defined in Tables 1 and 2. A blank cell in Model 3 results indicates that a variable is not included 
because its presence prevents estimation, due to low representation. 


we again find that substantive test detection is associated with MW classification (p — 0.062). 
However, an associated misstatement does distinguish MW from SD, when controlling for the 
nature of the ICD and other variables in the models. 


ICD Types 

Our models also examine severity classifications of specific types of entity-level ICD. Model 
2 results show differing associations across types. Coefficients on general ledger controls 
(p = 0.036) and ITGC (p = 0.092) are positive and significant. However, COSO categories of 
Information/Communications and Risk Assessment are negative ard significant (p = 0.034 and р 
« 0.001, respectively, two-tailed), and Control Environment is not significant. In contrast, Model 
3 shows that location in the Control Environment is a strong factor in distinguishing MW from SD 
(p = 0.001). Monitoring ICD are less likely to be MW than SD (p = 0.054), as are ITGC 
(p 0.085). 

We also examine the severity of specific account-level ICD types. Both revenue and tax 
accounts have positive coefficients in Model 2 (p = 0.002 and p = 0.026, respectively), consistent 
with regulators' attention to those areas prior to and during the study period. The revenue category 
is also significant in Model 3 (р < 0.001), likely due to the high volume of dollars flowing through ` 
that account. Model 2 results on the nature of detected ICD show positive and significant coeffi- 
cients for design flaws (p « 0.001), ineffective controls (p « 0.001), and insufficient testing 
(p = 0.003). In Model 3, coefficients are again positive and significant for design flaws 
(p = 0.003), ineffective controls (p < 0.001), and insufficient testing (p = 0.047). 


Control Variables 


Models 2 and 3 also contain control variables for client characteristics. Model 2 results show 
that SD/MW are more likely for companies with lower earnings quality as measured by higher 
accruals (р < 0.001), and for new public companies (р = 0.005). SD/MW are unexpectedly less 
likely in more complex companies as measured by segments (p < 0.001) and concentration of 
assets in receivables and inventory (p « 0.001). Model 3 shows that MW classification is more 
likely in firms with lower cash flows (р = 0.035), greater complexity as measured by ARINVTA 
(p = 0.047), and in new public companies (p = 0.030). 


Severity Classification with versus without an Associated Misstatement 


Because prior research suggests differences in auditor judgment processes based on objectiv- 
ity of evidence, we investigate the role of existing misstatements in severity classification by 


The Accounting Review May 2011 
American Accounting Association 


Detection and Severity Classifications of Sarbanes-Oxley 849 


estimating Models 2 and 3 in subsamples of ICD with/without an associated misstatement.” Table 
8 compares coefficients on Section 404 process and ICD type variables in Models 2 and 3, across 
subsamples. We focus our discussion on variables with a significant cross-model comparison, and 
those that are significant in both models. In Model 2, all significant differences between sub- 
samples are in the expected direction (i.e., a stronger association in the no-misstatement sub- 
sample). NUM MW. SQRT is significant in both subsamples (p — 0.059 and p « 0.001, respec- 
tively), but the coefficient is statistically stronger in the no-misstatement subsample (p — 0.029). 
Thus, the client's ICFR context plays a larger role in severity classification when a misstatement 
has not occurred. Section 404 process variables are significant only in the no-misstatement sub- 
sample: REPORT. INDEPENDENT (p = 0.002), with the difference across models significant at p 
0.010, IT. INTEGRATION (p = 0.003), and MONTHS, PRIOR AUDITOR (р = 0.004). Differ- 
ences in coefficients for these three variables are significant across models with probabilities of at 
least р < 0.05. No ICD type variable is significantly different across subsamples, but both models 
show.positive coefficients for revenues (misstatement: p = 0.097; no misstatement: p « 0.001) 
and design flaws (misstatement: p — 0.022; no misstatement: p « 0.001), indicating the pervasive 
importance of these ICD types. 

Table 8 also shows results of comparing coefficients of Model 3 between misstatement and 
no-misstatement subsamples. NUM MW. SQRT is again associated with MW classification in the 
no-misstatement subsample (p = 0.004) and its association is statistically greater than when there 
is a misstatement (p = 0.010). The same pattern holds for REPORT. INDEPENDENT (p = 0.017) 
and DETECTED SUBS (р = 0.002}. However, ІС RELIANCE is significant only when there has 
been a misstatement (p « 0.001), and the association is stronger in that subsample (p — 0.010). 
The latter result implies that when a misstatement amount is available, the auditor is less likely to 
judge the implications of the ICD to be material in clients for which the audit engagement was 
performed with the perception that overall controls were good. However, control reliance is not 
associated with MW classification when there is not a misstatement.”® Among ICD type variables, 
revenue ICD are more likely to be MW in both subsamples (p < 0.001 and p = 0.015, respec- 
tively). General ledger ICD are significant only in the no-misstatement subsample (p = 0.001), 
and the test of association between subsamples is significant (p = 0.039). 

Taken together, Table 8 results show that client and auditor process measures and character- 
istics are more likely to influence severity classification when the control has not yet failed to 
prevent/detect a misstatement. This pattern of findings 1s consistent with prior research on the 
impact of imprecise standards (Mayhew et al. 2001; Nelson et al. 2002; DeZoort et al. 2003), and 
implies that when a control failure is not yet documented, auditors use a wider variety of other 
information about the control flaw and its context to make and justify severity decisions. 


Ү, CONCLUSIONS AND LIMITATIONS 
This study contributes to the literature on ICFR by examining the relative effectiveness of 
companies and their auditors in Section 404 ICD detection, the processes used by those parties, 


*5 Due to the difficulty of interpreting interactions in logit, we compare coefficients between subsamples using seemingly 
unrelated estimation in Stata. Our cross-model comparisons are limited in that the presence of some variables prevents 
estimation in certain subsamples (e.g., consultant use in Model 3). 

26 There are several possible reasons for this pattern, which our data cannot distinguish. One is that clients with high 
control reliance might have more effective compensating or monitoring controls, which would mitigate a MW classi- 
fication. This could be due to auditors anchoring on the available misstatement amount (по! on the potential amount of 
future misstatements) and judging that amount to be less severe for a client with good controls. A further reason to 
downplay the severity of an ICD with a detected misstatement for an engagement in which controls have been relied 
upon, is that if a MW is found, the whole audit strategy of reducing substantive testing due to good controls is called 
into question. 
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and factors influencing ICD severity classification. Our data provide the only systematic control- 
level evidence available on the baseline set of detected ICD. Of those, about one quarter were 
remediated prior to year-end, showing progress in improving controls not visible to the public. Of 
all ICD detected, fewer than 4 percent are publicly disclosed as MW, but many companies having 
a "clean" Section 404 report have at least one SD—a control flaw serious enough to report to the 
audit committee. These results demonstrate the importance of looking beyond publicly available 
information to capture the full picture ICFR problems detected under Section 404. 

Prior research does not differentiate ICD detection by auditors and clients (Leone 2007). Our 
data show that auditors detect almost three-fourths of ICD, especially those that are more severe 
and pervasive. Importantly, this low level of client detection occurs when clients are aware that 
auditors will soon follow with their own tests. Our results also reveal company characteristics 
associated with better ICD detection, including use of large audit firm consultants, Section 404(a) 
reporting independent of management, and greater resources (company size). While we do not find 
the mere presence of an internal audit function improves detection, Lin et al. (2011) find that 
characteristics of internal auditors do influence the quality of companies’ Section 404 activities. 

Our results also shed light on auditor detection processes, a controversial issue at the heart of 
the debate on the value of Section 404(b). We find that two-thirds of auditor-detected MW are 
signaled through control tests. Therefore, without auditor control testing, many key flaws in ICFR 
could have been missed (e.g., over 80 percent of ICD in Control Environment and revenues). KS 
present a contrasting perspective, based on Audit Analytics data indicating that Section 404 opin- 
ions of companies with ineffective controls often mention misstatements. KS therefore suggest 
that substantive procedures, enhanced by “tracing back” from misstatements to the underlying 
control failure, could be an effective surrogate for Section 404 testing in identifying most com- 
panies with ineffective controls." There are several reasons underlying the difference in conclu- 
sions between their research and ours. Audit Analytics data are organized at the company level, 
not at the level of the individual MW. Thus, KS’s results reveal that misstatements are associated 
with at least one of a company’s MW, but not which or how many MW caused a misstatement.” 
Our data augment publicly available information by revealing the number of individual control 
flaws detected per company, and how each was detected. Thus, we show the degree to which a 
policy not relying on auditor testing of controls would miss important control flaws. Our data 
show that, for 27 percent of our companies with ineffective ICFR, no MW are associated with a 
misstatement, and in an additional 40 percent, one or more MW are not associated with a mis- 
statement. This implies that many MW would likely be missed if relying solely on detected 
misstatements to identify control deficiencies. Our data additionally show that most SD would not 
be identified through detected misstatements. In sum, we conclude that relying on misstatements 
to identify ICD will result in missing many severe control flaws. Thus, the intended overall 
purpose of SOX Section 404 (i.e., to improve financial reporting) would not be completely ful- 
filled for those companies. 

In addition to the difference in unit of analysis (company or ICD), our study differs from KS 
in the purpose of Section 404. They focus on disclosure of ineffective controls in the auditor's 
opinion as that Section's purpose. This view implies that if disclosure of ineffective controls (i.e., 


27 Our data show a similar proportion of companies with ineffective controls for which at least one MW is associated with 
a misstatement. KS cite a range of 57 to 73 percent based on the aggregated data; in our sample this proportion is 73 
, Percent. 
28 Additionally, reliance on "tracing back" from misstatements to identify related control flews assumes that the yield of 
misstatements detected from substantive tests would be the same without the informatian provided by the preceding 
extensive control tests required under Section 404. This seems unlikely, given that control tests should direct the 
auditor's attention to areas of greater risk. 
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at least one MW) was obtained for a company through an alternative approach, then a purpose of 
Section 404 would be fulfilled for that company. However, the purpose of Section 404 require- 
ments is to identify and correct control deficiencies and improve the reliability of financial report- 
ing (e.g., COSO 2006; PCAOB 2004). Thus, an ineffective controls opinion is a consequence of 
the process of controls assessment, but not its main purpose. In sum, publicly available data are 
limited in supporting a conclusion on the efficiency and effectiveness of the controls assessment 
and testing process in improving financial reporting reliability. For the reasons cited above, our 
data provide a more complete picture of the results of Section 404 activity. 

We also study how auditors classify ICD in the context of imprecise standards. For auditors, 
this situation is analogous to imprecise financial reporting standards, which have been shown to 
decrease auditors’ ability to prevail in booking proposed audit adjustments (Nelson et al. 2002). 
Our results show that ICD associated with an existing misstatement have higher severity classifi- 
cations. It is likely that a current control failure makes future failures easier to visualize, and 
provides more objective evidence to justify severity to clients (e.g., Kennedy et al. 1997). Never- 
theless, our data show that a misstatement is neither necessary nor sufficient for high severity 
classification, prompting our investigation of how auditors make severity judgments when the 
control has not yet failed. For these ICD, results generally imply а stronger association of task 
expertise and process independence, relative to those ICD that have failed in the current period. 
Use of a large accounting firm consultant, lower quality of internal control, better client IT 
integration, and auditor substantive test detection are more strongly associated with severity when 
the objective evidence provided by a current misstatement is unavailable. 

Our investigation of the nature of ICD also contributes to the literature by showing which 
types of entity-level and account-level controls that auditors consider could produce future mis-. 
statements most threatening to financial reporting quality. Within entity types, it is interesting to 
compare our findings on COSO Control Environment and Risk Assessment categories. Relatively | 
few Control Environment ICD are detected, but those are more likely to be classified as MW. In 
contrast, COSO Risk Assessment ICD are infrequently found, and are less likely to be classified as 
severe. While this might imply that companies’ risk assessment controls were of high quality 
during the study period, evidence from the recent financial crisis suggests that many companies 
did not effectively monitor their risks. Further research should examine whether companies and 
auditors have increased their focus on the quality of risk assessment controls in the current 
environment. 

Among specific accounts/cycles, revenue ICD are more likely to be severe, consistent with the 
importance of revenues in equity valuation. We also find that tax ICD are more likely to be 
classified as at least SD. This is consistent with auditors’ concerns that many clients during the 
study period were unable to perform the tax accrual without assistance. Regarding the nature of 
contro] failures, we find that ineffective controls (well designed but not functioning) are most 
frequent. Those ineffective controls, as well as design flaws (comprising missing controls as well 
as controls that do not meet their intended purpose), and controls that have been insufficiently 
tested by clients, are all considered more severe relative to documentation problems. Further 
research should investigate whether the incidence of these more severe types of control flaws has 
declined following several years of ICD remediation and retesting under Section 404. 

Our study complements a behavioral experiment by Earley et al. (2008), which finds that 
clients’ classifications influence auditors’ judgments. Our archival data show that clients’ classifi- 
cations are overwhelmingly lower than those of auditors, implying that auditors often override — 
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clients’ views.” Client under-classification could be caused by differing incentives, uncertainty 
regarding appropriate procedures (due to lack of precise SEC guidance for company managements 
during our study period), and/or lack of knowledge of internal controls. The first explanation 
seems consistent with our finding that clients with a more independent Section 404(a) process are 
less likely to under-classify, but future research should investigate these and other possible causes. 

The generalizability of this study’s findings may be limited by several features of its design. 
First, our conclusions are limited by restricting the sample to smaller accelerated filers that more 
closely apply to non-accelerated filers and foreign companies not currently subject to Section 404 
or similar regulation. Second, engagements in our sample were performed under AS No. 2. Ad- 
ditional guidance on company and auditor processes for Section 404 were subsequently issued 
(SEC 2005; 2006), and AS No. 5 (PCAOB 2007) superseded AS No. 2. Both the revised auditing 
standard and related SEC guidance were directed toward helping companies and auditors achieve 
greater efficiencies but not to “lower the bar” for reporting issues. While the current study’s results 
should have continuing relevance, research in the AS No. 5 environment would provide insight on 
effects of application of that standard. Third, it is difficult to directly compare the proportions of 
MW identified in our sample with public disclosures, as individual control flaws may be aggre- 
gated in those reports. Fourth, sample engagements were chosen by participating firms. While our 
sample represents publicly available data at the company level (as indicated by statistical com- 
parisons with public data reported above), we cannot rule out the possibility that nonrandom 
selection might have influenced our results. 

In sum, this study provides a unique view into processes and outcomes of the first two years 
of Section 404 compliance. In terms of process, our results support the value of auditor involve- 
ment at two stages of the ICFR assessment process (detection and classification), and contribute to 
the understanding of factors associated with client and auditor performance in both stages. Re- 
garding outcomes, we show that even companies with a clean Section 404 auditor's opinion can 
have numerous and/or significant ICD. Having been identified, those ICD may now be corrected, 
thus improving financial reporting quality. Overall, results of this study imply that the recently 
enacted exemption of non-accelerated filers from SOX Section 404(b), which requires auditor 
testing and an audit report on internal control, is likely to undermine the objectives of SOX for 
these entities, resulting in relatively less reliable financial reporting than had 404(b) provisions 
been implemented. Further, risk assessments in subsequent years may be less effective without the 
detailed information on client controls that comes to light when the independent auditor detects 
and classifies control flaws. 
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ABSTRACT: This paper develops a model of financing that jointly determines a firm's 
capital structure, Its voluntary disclosure policy, and its cost of capital. Investors who 
receive securities in return for supplying capital sometimes incur losses when they 
trade their securities with an Informed trader. The firm’s disclosure policy and the struc- 
ture of its securities determine the Information advantage of the informed trader апа, 
hence, the size of investors’ trading losses and the firm's cost of capital. 

We establish a hierarchy of optimal securities and disclosure policies that varies 
with the volatility of the flrm's cash flows. Debt securities are often optimal, with the form 
of debt—risk-free, investment grade, or "junk" —varying with the firm's cash flow vola- 
tility. Though the model predicts a negative association between firms’ cost of capital 
and the extent of Information firms disclose, more expansive voluntary disclosure does 
not cause firms’ cost of capital to decline. Mandatory disclosures alter firms’ voluntary 
disclosures, their capital structure choices, and their cost of capital. 


Keywords: cost of capital; capital structure; voluntary disclosures. 


L INTRODUCTION 
e develop a model that jointly explains a firm's voluntary disclosure policy, its capital 
W structure, and its cost of capital. While not previously recognized in the literature, it is 
intuitive that there should be a relationship between a firm’s capital structure and its 
disclosure policy, because—barring agency problems—managers will choose their firm's disclo- 
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sure policy so as to maximize the market’s perceptions of the expected value of the firm-owners’ 
residual claims. Hence, the form of those residual claims—equivalently, the form of the securities 
that the firm sells to investors—affects what managers maximize. In turn, the firm’s disclosure 
policy and its capital structure choices affect the firm’s cost of capital. 

In the model we study, a firm receives capital by issuing securities that promise future cash 
payouts to investors. The central feature of the model is that the amount investors are willing to 
pay for the securities may be less than the expected present value of the securities’ cash payouts 
because the investors anticipate at the time they buy the securities that they may bear trading 
losses if they have to liquidate the securities “early”—.e., before maturity. These potential trading 
losses arise because the market in which the securities are traded is presumed to be populated by 
some superiorly informed traders. Compensating investors for these expected trading losses 
ex ante constitutes а cost the firm bears in raising capital: it is the source of the "cost of capital" 
in our model. 

The firm's manager can reduce investors’ expected trading losses by reducing the information 
asymmetry between the superiorly informed traders and the market maker who sets the price for 
the securities. This can be accomplished in two related ways. First, since the informed traders’ 
private information about the firm is likely to overlap with the manager’s private information, the 
manager can disclose his private information.’ Second, the manager can have the firm issue 
securities whose value is not “informationally sensitive” (e.g., Sunder 2006), meaning that their 
value does not vary with the informed traders’ private information. For example, were the secu- 
rities risk-free debt, then the "informed" traders—Tregardless of their information—would have по 
information advantage over the market maker in assessing the securities" value. Thus, a firm's 
capital structure and its disclosure policy jointly determine how much information asymmetry 
remains among market participants, and this remaining information asymmetry determines inves- 
tors’ expected trading losses and, hence, the firm's cost of capital. 

We develop a formula, applicable to all securities, that establishes how a firm's cost of capital 
depends on the security's upside potential (the difference between a security's payoff if the firm's: 
realized cash flows assume "high" or "medium" values) and its downside risk (the difference 
between a security's payoff if the firm's realized cash flows assume “medium” or “low” values). 
The formula shows that the contribution of a security's upside potential and downside risk to the 
firm's cost of capital varies with the firm's disclosure policy. Specifically, the contribution of a 
security's upside potential to the firm's cost of capital decreases as the firm discloses more infor- 
mation, and, perhaps surprisingly, the contribution of a security's downside risk to the firm's cost 
of capital increases as the firm discloses more information. 

Our model establishes that debt is always an optimal security and that there is a hierarchy 
among debt securities and optimal disclosure policies that varies with the volatility of the firm's 
cash flows. A firm with very low volatility in its cash flows prefers to raise capital by issuing 
risk-free debt and adopting an "expansive" disclosure policy? As its cash flow volatility increases, 
the firm prefers to use investment-grade debt (that defaults with low probability) combined with 
the continued use of an expansive disclosure policy. Then, as its cash flow volatility increases still 
further, the firm will continue to use investment-grade debt, but it will curtail its disclosures and 
adopt a "limited" disclosure policy. As its cash flows become even more volatile, the firm will 
switch to using "junk" debt (that defaults if anything other than the highest cash flows occur), 


! We conform to the AAA's style guidelines by mixing gender references in the following: specifically, when describing 
or referring to actors in the model, we arbitrarily assigned the gender of the manager and market maker to be male, and 
the gender of the investor and informed trader to be female. 

2 We defer the precise definitions of an “expansive” and a “limited” disclosure policy to the text below. 
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accompanied by limited disclosure. Finally, as its cash flow volatility becomes greater still, we 
demonstrate that it is impossible to finance the firm with any form of security accompanied by any 
form of disclosure. 

While reminiscent of Myers and Majluf’s (1984) famous financial hierarchy (inside financing 
is least expensive; outside financing with debt is more expensive; outside financing with equity is 
most expensive), our hierarchy is distinct from Myers and Majluf’s (1984) hierarchy in two 
fundamental respects. First, our hierarchy combines a firm’s capital structure choice with its 
voluntary disclosure policy, whereas Myers and Majluf (1984) make no reference to firms’ dis- 
closures. Second, our capital structure hierarchy is indexed to the volatility of a firm’s cash flows, 
whereas Myers and Majluf’s (1984) hierarchy is indexed to the relative costliness of alternative 
financing methods, hoiding cash flow volatility fixed. 

We also explore how mandatory disclosure requirements interact with firms’ voluntary dis- 
closure decisions to affect firms’ cost of capital and securities design choices. Some mandatory 
disclosure requirements are shown to inhibit firms from making voluntary supplementary disclo- 
sures, and other mandatory disclosure requirements are shown to encourage firms to make volun- 
tary supplementary disclosures. In addition, we show that imposing mandatory disclosure require- 
ments sometimes radically alters what securities firms wish to offer their investors. In particular, 
we show that it is sometimes optimal for the firm to raise capital by issuing equity rather than debt. 


Related Literature 


While links between a firm’s disclosure policy and its cost of capital have been established in 
prior accounting research (e.g., Botosan 1997; Botosan and Plumlee 2002), and links between a 
firm's capital structure and its cost of capital separately have been identified through research in 
finance (e.g., in Myers and Majluf 1984), we believe this is the first study in accounting or finance 
that endogenously connects a firm's disclosure policy to its capital structure, and a fortiori, the 
first that connects all three of these components of a firm's financial structure. 

Other models in accounting and finance have emphasized how information asymmetry affects 
firms’ decisions to raise capital. For example, Korajczyk et al. (1991, 1992) noted that firms tend 
to raise new capital in periods where the information asymmetry between insiders of the firm and 
outside investors is small so as to eliminate the lemons' problem with issuing equity (noted 
originally by Myers and Majluf [1984]). Unlike Korajezyk et al. (1991, 1992), we are (1) con- 
cerned about not when firms raise capital, but rather in what form firms raise capital, and (2) not 
concerned about disclosures before the firm raises capital, but rather about disclosures after the 
firm has raised capital. 

Several recent studies discuss links between the quality of firms' financial statements and their 
cost of capital. For example, Jorgensen and Kirschenheiter (2003) show that firms have incentives 
to disclose information selectively that indicates lower cash flow risk, thus leading to low-risk 
premia conditional on voluntary disclosure. As another example, Christensen et al. (2002) develop 
a theory that explains when and how managers' compensation should be adjusted for the cost of 
capital. 

Our model is also linked to the large literature in finance on securities design. For example, 
see Harris and Raviv (1991) for an early survey of this literature, and Nachman and Noe (1994), 
DeMarzo and Duffie (1999), Demange and Laroque (1995), Rahi (1995), Fulghieri and Lukin 
(2001), Sunder (2006), Bias et al. (2007), Casamatta (2003), and Repullo and Suarez (2004) for 


* But, the present model and Korajczyk et al. (1991, 1992) share the important property that the disclosure of information 
has real effects, which is also true of a substantial part of the disclosure literature (е.2., Kanodia et al. 2000; Stocken 
2000; Pae 2002). 
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more recent work. As far as we are aware, none of the prior research on securities design has 
emphasized the interdependencies among securities design, cost of capital, and voluntary disclo- 
sures, which is our focus. 


П. MODEL SETUP 

The timeline for the model consists of four steps, presented in Figure 1. At step t = 1, an 
individual (called the *managez") owns a production technology (called the “firm”) that converts 
capital invested in the first step stochastically into cash flows 0 in the fourth step. The manager, 
who has no capital, offers (one unit of) a security Џ to an investor in return for the investor 
supplying capital to the firm. 

At step г = 2, with probability p є (0,1) the manager privately learns the realization of the 
firm's cash flows, and with probability 1 — p, the manager learns nothing about those cash flows. 
If the manager receives no information, then he necessarily makes no disclosure. But, if he does 
receive information, then he may either disclose that information or disclose nothing, based on 
whether the disclosure or nondisclosure increases the capital market's perceptions of the expected 
value of the manager's residual claim 0-4Д 6).4 We assume all disclosures are truthful, but the 
manager cannot credibly disclose that he has received no information. We let 4(0) = Ø indicate 
that the manager makes no disclosure and 4(0) = 0 indicate that the manager discloses that the 
cash flows are 6. 

At step t — 3, a market for the security " opens.? With probability u the investor experiences 
a liquidity shock that forces her to sell her entire interest in security y on this market. We let 
а = 1 (q = 0) indicate that the investor is (is not) subject to a shock, so Pr(ĝ= 1) =v.° If not 
subject to a shock, though the investor may still sell her security, it will be clear in what follows 
that she will elect not to. The market clearing price of security ij is then established via a Kyle 
(1985)-type market maker, who sets the security's market price based on knowledge of the aggre- 





FIGURE 1 
Timeline 
t=] t= 2 153 154 
Se ne ЧА › 
Manager designs Nature draws Mansger can Investor sells with A market maker Cash flows are 
a security and flow make truthful probabllity у for observes d 
sells it to investor realization 0, public disclosure liquidity reasons. information and The current holders of 
in exchange for observed by the d and soils his An informed aggregate order flow, the claims receive 
the supply of manager with residual claim trader can gubmita posting а price for the appropriate contingent 
cap probability p. on a market. market order. 


We take the assumption that the manager seeks to maximize the expected value of his residual claim as exogenous. In 
an earlier working paper, we endogenize the objective function, The working paper version also extends the technology 
to two required inputs, capital and “effort” supplied by the manager. 

In a working paper version, we had two securities markets open, one for the security # and one for the manager’s 
residual security 0-ф. The results obtained by adding this second securities market are the same аз in the present 
model. 

Our results do not depend on the assumption of an asymmetry in the frequency of the investor's and manager's liquidity 
shocks. What is critical is not how frequently the manager faces а liquidity shock vis-à-vis the investor, but rather that, 
whether subject to a liquidity shock, the manager decides whether to disclose his private information (when he receives 
it) based on whether the disclosure increases his firm's expected selling price. 
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gate order flow in the security. This aggregate order flow potentially includes orders from an 


informed trader. While the informed trader may go “long” in security ij, we presume she is 
prohibited from “shorting” the security.’ 

Formally, we represent the informed trader as one who privately knows, when the manager 
makes no disclosure, whether the reason for the manager’s nondisclosure was due to the manages 
not having received information or having deliberately withheld information he did receive. * In 
the latter case—where the investor knows that the manager withheld information—we presume 
that the trader does not know the value of the information the manager received. This constitutes 
the source of the informed trader's informational advantage over the market maker, when the 
manager makes no disclosure, the market maker, unlike the informed trader, does not know the 
reason for the manager's nondisclosure. This formal representation is intended to depict one of 
many possible ways an informed trader knows less than the manager. Results similar to those we 
obtain would hold for other representations where the informed trader's information is a coarsi- 
fication of the manager's information. 

At step t = 4, the firm's cash flows occur, and the holders of the security at this time are paid 
off according to the terms specified by the security.” 

Having completed the description of the timeline for the model, we conclude this introduction 
by offering more specifics about both the firm’s stochastic production technology and the security 
the manager offers investors. The production technology requires a fixed amount of capital, which 
we scale to be $1, to be supplied by an investor. Less than $1 of capital generates no cash flows 
in step 4 at all, and more than $1 does not increase step 4’s cash flows. For notational convenience, 
we normalize the investor’s required rate of return on capital to 0. Given these normalizations, the 
manager can induce the investor to supply sufficient capital to run the technology by ensuring that 


the security ф has an expected payoff to the investor of at least $1. 

With the $1 capital input, the firm’s cash flows (in step 4) are given by the realization of the 
random variable 6, which assumes one of three possible values: 0; = и – е, or 0, = ш or 6, 
= u + e, for some e > 0. Each of 6, and 6, occurs with probability 2 є (0, 1), апа 6,, occurs with 
probability 1 — 7. We further assume that н — e > б and д > 1. In short, the firm's realized cash 
flows are symmetrically distributed around their mean и; the cash flows are always positive, and 
investing in the production technology is efficient. 

The security й yields a payoff that is potentially contingent on the firm’s realized cash flows. 
Given that the firm’s cash flows can take on three values, a security ф can be described completely 
via the triple (д, Ши» Wa), with 4540). The securities we study include debt, equity, hybrids 
that mix debt and equity, and, more generally, any security that satisfies: (1) limited liability for 
the manager (# 6); (2) limited liability for the investor (0 = i); and (3) monotonicity in the 


We justify this assumption by noting that there are obvious and well-known restrictions and frictions that make 
short-selling difficult, and these restrictions have become even more onerous with the SEC’s September 2008 actions 
(SEC's “Financial Firm Emergency Order” No. 34-58592; Sept. 18, 2008). Even if we allowed for shorting, the 
informed trader could make no extra profit by shorting, because any decision to short would reveal her information to 
the market maker. 

Dye (1998) uses a similar information representation to model privately informed investors. 

The key feature of the information structure is that it is “tiered,” with the manager always weakly better informed than 
the informed trader, and the informed trader always weakly better informed than the market maker. While we have 
assumed that—when the informed trader knows that the manager is withholding information—the informed trader does 
not do anything about what information the manager is withholding, we could have assumed that the informed trader 
imperfectly knows what the manager knows. We expect that our main results would continue to hold in the latter case, 
with few changes. We know, for example, that our results are valid with no changes whatsoever if we assume the 
informed trader learns that, when the manager has withheld information, the informed trader learns into which element 
of the partition {{ 9,},{9„, 9} the manager's private information falls. 


o o 
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sense that both (6) and 0 — X6) are weakly increasing in 0, with 6,— Ө) < 0, 40). In the 
following, we collectively refer to (1)-(3) as the "limited liability end monotonicity assumptions.” | 
Assumptions similar to these natural, standard assumptions have been adopted in many studies of 
securities’ design. 


Preliminary Analysis 

We start the analysis by observing that there are only three possible equilibrium levels for the 
volume, X, of the aggregate order flow for the security. This follows from the assumption that, 
when the investor experiences a liquidity shock, the investor must sell 100 percent of her security. 
Since the informed trader has an interest in masking the identity of her trades, either she does not 
participate in the securities market, or else she submits a market order for exactly the same 
quantity of the security the investor sells (which we normalize tc 1).!' Hence, X's only possible 
values are —1 (if the investor experiences a liquidity shock and the trader stays out of the market), 
0 (if either (a) the investor experiences a liquidity shock and the trader trades or (Б) neither the 
investor experiences a liquidity shock nor does the trader trade), or +1 (if the investor does not 
experience a liquidity shock and the trader trades). 

Next, we consider how the security y is priced by the market maker. Upon observing the 
manager’s disclosure (if any) and the aggregate order flow X, the market maker posts a price 
Р(4,Х) for the security, such that P(d , X) = ЕІ 4, X). This price is what the investor receives by 
selling her security. On average, an investor who faces a liquidity shock must anticipate having to 
sell her security for less than its expected value because of the presence of the informed trader. 2 
So, the ex ante value of the security to the investor: 


Кф=а) = vE(P(d,X)| -11-0-0)Е(Й, (1) 


typically will be below the unconditional expected value ЕС) of the security. The difference 
between E( 7 апа Kọ, d) is the firm’s “cost of capital” (CC) in this model: 


СС(ф,4) = Е(Й”-1(6,4). 0) 


The Downside Risk and Upside Potential of Securities 


In the following analysis, it is often desirable to characterize a security in terms of how the 
value of the security changes with the firm’s realized cash flows, as in the following definition. 
Definition 1: (a) The “downside risk” of security (Л, фи, Чл) is given by Ap = y, — д; 
(b) the "upside potential” of security (W, Ym» Фо) is given Бу Ay= V, фи. 
The downside risk of a security is the amount by which the раус of the security drops as а 
consequence of having the low state 0, occur, rather than the medium state, and the upside 
potential of a security is the amount by which the payoff of the security increases as a conse- 
quence of having the high state 9, occur rather than the medium state 6,,. In view of the relation- 
ship i4, = ij Ap and фе у + р Ар, a security with payoffs (Ш, Ym, Wn) can be described 


10 See, e.g., Harris and Raviv (1990), Innes (1990), Nachman and Noe (1994), DeMarzo and Duffie (1999), 

Н Were the informed trader to submit market orders in quantities different from that of the investor, she would reveal her 
trades and thereby eliminate her information advantage over the market maker. 

12 This fact establishes, as was asserted earlier, that were the investor not subject to a liquidity shock, the investor would 
not participate in this market. Instead, she would wait until the firm's cash flows realizes its value and collect the 
cash-flow-contingent payment as specified by the security. 
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equivalently via the triple (у, Ap, Ay). It is straightforward to show that, in terms of this triple, 
the limited liability and monotonicity assumptions are equivalent to the following three condi- 
tions: (1) 0 € A, Se for i = D,U, (2) A; = 0 for at least опе i = D,U, (3) 0 = js O=p-e. 
Debt securities, which take the form = min(F , 6} for some face value F, play an important 
role in the following. Definition 2 classifies debt instruments in terms of the triple (Ш, Ap, Ay). 


Definition 2: A security (y, Ap, Ay) is “risk-free debt" when it is of the form (4,0,0); 
it is “investment-grade debt" when it is of the form (u - e, Ар, 0) where 
Ap 0; and it is “speculative (or junk) debt" when it is of the form 
(р — e, e, Ду) where Ay > 0. 


The classification of debt securities makes clear that the basic difference among different types of 
debt securities is determined by the states in which default occurs (or, alternatively, the probability 
of default): a firm never defaults on risk-less debt, it defaults on investment-grade debt only if the 
low state occurs, and it defaults on speculative debt if anything other than the high state occurs. 

Equity securities take the form (i, Ар, Ay) = (al u- e) , ае, ae) for some а є (0,1); hybrid 
securities are securities that are neither debt nor equity. Below we show that, when all disclosures 
are voluntary, both equity securities and hybrid securities are dominated by debt securities, and 
that equity securities may be optimal when some disclosures are mandatory. 


Capital Structure-Dependent Disclosure Policies 
Lemma 1 characterizes how the manager's preferred disclosure policy varies with the firm's 
capital structure/security design choice ý. 


Lemma 1: Suppose the manager offers the investor a security (Ш, Ар, Ay) that satisfies the 
limited liability and monotonicity conditions. Then, the possible disclosure 
policies 4(-) the security can induce the manager to adopt are the following: 
d(6,) = 6, d(6) = ©, and: 


if ре < Ap- (1-р)Ду, then d(6,,) = Ø; (3) 


if ре > Ар-(1-р)Ау, then d(6,,) = б„; (4) 


and if each strict inequality in (3) and (4) is replaced by an equality, then the 
security can induce the manager either to disclose, or to not disclose, й= би. 


Lemma 1 shows that any security that satisfies the monotonicity and limited liability condi- 
tions always induces the manager to disclose 0, and never induces him to disclose 0), but whether 
the security induces the manager to disclose 0,, depends upon how large the security's upside 
potential is as compared to its downside risk. The proposition thus formalizes one of the claims 
made in Section Ш: a firm's capital structure—that is, the securities it uses to raise capital from 
investors—affects the firm's disclosure policy. 

The intuition for the proposition is as follows. Consider downside risk first and recall that the 
manager's disclosure decision is based on whether the disclosure increases the market's percep- 
tions of the value of his residual claim 0 — 40). It follows that, when the value of his residual 
claim rises substantially due to having the medium state rather than the low state occur, the 
manager wants the capital market to know it. Ergo, as the security's downside risk decreases, the 
manager is more likely to prefer disclosing 0,. A similar analysis shows that the manager’s 
preference for disclosing 0, also depends on the upside potential of the security. When the 


inequality in (3) and (4) holds, we say that security y induces the manager to adopt a limited 


(expansive) disclosure policy and also that d(-) is the disclosure policy induced by w. 
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It is also worth noting that Lemma 1 is consistent with, and extends, some classic comparative 
statics results of voluntary disclosure. For example, the proposition implies that, as e decreases— 
ie. as the volatility of the firm's cash flow decreases (in the sense of second-order stochastic 
dominance)—then a firm is less likely to engage in expansive disclosure. This implication is 
consistent with Jung and Kwon’s (1988) comparative static that, as a firm's cash flow volatility 
decreases, the threshold above which it discloses information (and below which it withholds 
information) increases. Also, by rewriting Lemma 1 so as to emphasize how the switch from 
limited to expansive disclosure depends on the probability p that the firm receives information, we 
see that there is a threshold probability р" = E such that the manager engages in expansive 
(limited) disclosure when p > p* (p < р). This is consistent with Dye's (1985) and Jung and 
Kwon's (1988) result that the disclosure threshold declines—and firms are more likely to disclose 
their private information—as the probability they receive information increases. 





Informed Trader's Optimal Trading Strategy 

When the manager discloses the information he received, the market maker is at informational 
parity with the informed trader. Hence, the trader does not trade, since she knows there is no 
opportunity to earn positive expected trading profits.? But, recall that, when the manager discloses 
nothing, the trader, unlike the market maker, knows whether the nondisclosure is due to the 
manager not receiving information or due to the manager withholding information. The insider 
will recognize that the market maker is likely to underprice (overprice) the security when the 
insider knows the reason the manager made no disclosure is because the manager did not receive 
(withheld) information, and this likely underpricing (overpricing) will trigger the insider to trade 
(stay out of the market). This provides tbe intuition for Lemma 2. 


Lemma 2: The informed trader's optimal trading strategy is to submit a market order to buy 
security ў when the manager makes no disclosure and the trader knows that the 
reason the manager made no disclosure was because the manager did not receive 
information; in all other circumstances, the trader does not trade. ^ 


Determinants of a Firm's Cost of Capital 

In this section, we provide a detailed analysis of how a firm's cost of capital depends on the 
design of its securities and the firm's disclosure policy. 

For a risk-free security, that is, a security whose payout (6) is constant across all realizations 
of 6, the cost of capital is 0 because the market maker never misprices such a security. Even a 
risky security—i.e., one whose payout (6) varies with 0-48 correctly priced by the market maker 
in three situations: (1) the manager discloses the firm's realized cash flows; (2) the manager makes 
no disclosure and the aggregate market order for security ф is Х = 1 (because X = 1 occurs only 
when the trader places a market order and the investor did not experience a liquidity shock, which 
reveals that the trader knew the manager’s nondisclosure was due to the manager not receiving 
information); or (3) the manager makes no disclosure and the aggregate market order for security 


ф із Х = —1 (because X = —1 occurs only when the investor, and not the trader, places a market 
order, which reveals that the trader knew the manager’s nondisclosure was due to the manager 
withholding information). 

What creates trading losses for an investor who experiences a liquidity shock is the situation 


В We assume that traders who make 0 expected trading profits do not trade. This assumption is justified by any, even 
trivial, transaction costs from trading. 
14 The proof of Lemma 2 is straightforward and omitted. 
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in which the manager makes no disclosure and X = 0. From the market maker’s perspective, this 
last event is consistent with either: (4) neither the investor nor the informed trader places a market 
order for security ф, ог (5) both the investor (because she faced a liquidity shock) and the market 
maker (because he knew that the manager’s nondisclosure was due to the manager not having 
received information) place a market order for security $. In this case, the trader is better informed 
than the market maker, giving rise to the investor's trading losses. By determining how probable 
this situation is, and tlie size of the investor's expected trading losses when it occurs, we can 
calculate the firm's cost of capital. The next proposition displays the results of this calculation, for 
any security and any disclosure policy. (In stating Proposition 1, we write d = 1 for an expansive 
disclosure policy (in place of 4(6,) = 6,) and а = 0 for a limited disclosure policy (in place of 
d(8,) = 2). 


Proposition 1: The firm's cost of capital when it issues security = (ijj, Ар, Ay) and adopts 
disclosure policy d is given by: 


сс(фа) = AT, X Ay * A2, X Ap, (5) 


where 


p(l- ТЕ +(1- 20-2) 
Г eeu 
(1 - p)v* p(1- ЈЕ + (1– 40-9) 


pl- ТЕ +а1-®) 
о 
ü- pv«pü - (2 «a-4d- 5] 


Before discussing Proposition 1, we introduce Corollary 1, which helps to illustrate some of the 
economic consequences of the proposition. 


Corollary 1: 


(a) Пи oe 2; and 

(b Ab Ab СА+А < 2 

Corollary i shows that the firm's cost of capital varies with each of: (1) the security's 
downside risk Ap and its upside potential Ay; (2) the specifics of the information and trading 
environments in which the securities are traded (ie., the probability that the manager receives 
information p and the probability that the investor is subject to a liquidity shock 1); (3) the 
technology that produces the firm's cash flows (i.e., the probability distribution of 6, as summa- 
rized by the parameter 7); and (4) the manager's disclosure policy d. 

The ordering of the coefficients Aj, < АЎ, shows that the upside potential of a security con- 
tributes less to the firm's cost of capital for an expansive disclosure policy than for a limited 
disclosure policy. Stated differently, more disclosure favors securities with higher potential. By 
contrast, the result Aj, > Аб in Corollary 1 shows that the downside risk of a security contributes 
more to the firm’s cost of capital when the firm’s disclosure policy is expansive (d = 1) than when 
the firm’s disclosure policy is limited (d = 0). Stated alternatively, less disclosure favors securities 
with higher downside risk. 
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The intuition for the result is that the firm’s cost of capital is the same as the investor’s 
expected trading losses. The investor bears trading losses only when she sells her security follow- 
ing a liquidity shock and the informed trader buys the security. Note that the probability these two 
events both occur is independent of whether the manager adopts an expansive or limited disclosure 
policy, as this probability is v X (1 — p) in all cases. Hence, the only reason the investor's 
expected trading losses would be higher under one disclosure policy than another is if the equi- 
librium price at which the investor sells her security when the aggregate order flow is 0 is lower 
under one disclosure policy than the other. To isolate the effects of expanded disclosure on a 
security's downside risk, consider a security that has only downside risk (1.е.. Ay=0). Such a 
security provides two distinct possible payoffs to its holder, a “low” payoff when w= y or a 


“high” payoff (which is the same whether фо, ог фе y). Under the expansive disclosure 
policy, the manager only withholds information when the security’s payoff is "low," while under 
the limited disclosure policy, the manager withholds information both when the security’s payoff 
is "low" and also sometimes when the security's payoff is "high" (specifically, when w= Wp). 
Consequently, when the market maker calculates the equilibrium price when d = @ and X = 0 for 
a security with no upside potential, the calculation will result in a lower price when the manager 
adopts the expansive disclosure policy than when he adopts the limited disclosure policy. This 
provides intuition for the ordering A}, > A9. 

The result Al, СА in Corollary 1 shows that the upside potential of a security contributes 
less to the firm’s cost of capital for an expansive disclosure policy than for a limited disclosure 
policy. Stated differently, more disclosure favors securities with higher upside potential. The 
intuition proceeds along lines similar to that in the preceding paragraph. 5 

The results reported in Corollary 1 also show that, if the upside potential and downside risk of 
a security are the same, then the firm’s cost of capital is higher under expansive disclosure than 
under limited disclosure; that is, with A=Ap=Ay, СС(у, 0) = (45 A0) ХА < (А + AD) 
X A2 СС(ф, 1). 

The orderings depicted in the corollary are important to studying both the design of optimal 
securities while holding a firm's disclosure policy fixed, and the optimal choice among disclosure 
policies, as will be seen from the results in Propositions 3, 4, and 5 below. "8 


15 This probability 11 — p) follows from the fact that vis the probability of a liquidity shock, and the informed trader only 
buys when she knows that the reason for the manager not making a disclosure is that the manager did not receive any 
information, which occurs with probability 1 — р. 

16 To isolate the effects of expanded disclosure on a security's upside potential, consider a security that has only upside 

potential (so Ap = 0). Such a security provides two distinct possible payoffs to its holder, a "low" payoff (which is the 

same whether фа yy or ф= фу) ог a "high" payoff when фе Yp. Under the limited disclosure policy, the manager 
always withholds information when the security’s payoff is “low” (i.e., regardless of whether iss фу or фа фо), while 
under the expansive disclosure policy, the manager sometimes discloses information even though the security’s payoff 
is “low” (specifically, when ф= у). Consequently, when the market maker calculates the equilibrium price when d = 

Ф and X = 0 for a security with no downside risk, the calculation will result in a lower price when the manager adopts 

the limited disclosure policy than when he adopts the expansive disclosure policy. This provides intuition for the 

ordering А! < AD. 

When the aggregate order flow is 0 and the manager makes no disclosure, under expansive disclosure the security’s 

equilibrium price depends оп the manager withholding information only when the security's payoff is у, whereas under 

the limited disclosure policy the manager withholds information both when the security's payoff is v, and when its 

payoff is фи. Since, for a security with equal downside risk and upside potential, v, = ЕЙ > d; (as: ЕГ] = V, for a 

security with equal downside risk and upside potential), it follows that the firm's equilibrium price will be lower when 

d = © and X = 0 under the expansive disclosure policy than under the limited disclosure policy, which explains why 

СС(Ё,0) < СС(ф, 1) in that case. This provides intuition for the ordering АО + A9. < Ab + А. 

There are many additional comparative statics that can be derived from Proposition 1. To save space, these comparative 

statics are not presented here, but they are available from the authors. 
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Ш. OPTIMAL SECURITY DESIGN AND DISCLOSURE POLICY 

In this section, holding fixed the manager's expansive or limited disclosure policy, we identify 
the manager’s most preferred security among all securities that satisfy the limited liability and 
monotonicity conditions. 

It is intuitive, and it is possible to show formally, that an optimal security minimizes the firm’s 
cost of capital, ASAp +AgAy, subject to a suitable set of constraints. When the security is de- 
signed to induce the manager to implement disclosure policy d {0,1}, the program that identifies 
the optimal security is the following: 

Optimal Security Design d (OSDd) Program: 


min АЗАр+А Ау (6) 
Фарбу 
subject to: ` 

(1 - 2d)pe = (1 - 2Ф(Др- (1 - p) Ay); (7) 

1 ж 1= Ер АА-АА; (8) 

OS h p-e; and (9) 

OSA Se, іє). (10) 


The OSDd Program incorporates several results derived above: constraint (7) requires that the 
manager’s adopted disclosure policy d be incentive compatible with the manager’s security design 
choice (Lemma 1); constraint (8) requires that the investor be willing to purchase the security 
(Lemma 2); and constraints (9) and (10) reflect the monotonicity and limited liability constraints. 
A security is said to be feasible if it satisfies all these constraints. 

Clearly, a lower bound on the firm’s cost of capital is 0. When this bound can be obtained, we 
say that it is “first-best.” The first-best situation arises if and only if Ap = Ay=0, i.e., the security 
issued is risk-free debt. The maximum amount of capital the manager can raise by issuing a 
risk-free security is р — e. Hence, first-best financing is achieved if and only if the required capital 
to implement the investment (= 1) is less than p — є, 1.е., if and only if the volatility e in the firm's 
cash flows does not exceed u — 1. 

The next proposition reports that the risk-free debt security achieves first-best and induces the 
manager to adopt the expansive disclosure policy. 


Proposition 2: There is a security that implements the expansive disclosure policy that yields 
the first-best outcome if and only if e < р — 1. That security must take the | 
form of risk-free debt with face value F є [1р — e]; it always induces 
expansive disclosure. 


Proposition 2 provides the first illustration of the sensitivity of the structure of optimal secu- 
rities to the size of the firm's cash flow volatility, as measured by the parameter e. This sensitivity 
will be shown to be a central feature of the analysis that follows. Since Proposition 2 establishes 
the optimal security for e < р — 1, the following analysis focuses on the case of e> u — 1. 


Optimal Securities that Implement Limited Disclosure 

This section studies optimal securities that implement the limited disclosure policy. The main 
result is to establish the existence of a hierarchy of optimal securities, where what constitutes an 
optimal security varies with the volatility parameter e. 
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To proceed, we observe that we can set y= u ~ e in Program OSDO with no loss in profits to 
the manager. With this substitution, it is clear that Program OSDO is a linear program in the two 
variables Ар and Ay that can be solved geometrically. To obtain the optimal security, all that has 
to be done is to search for the lowest iso-cost line for the objective function, i.e., the lowest value 
k among the lines АОЛ + А Ду =k that lies within this constraint set. Figure 2 illustrates the 
linear program that yields the optimal security that implements limited disclosure. 

The following proposition summarizes the findings from the geometric investigation of the 
OSDO Program in the Appendix. 


Proposition 3: The optimal security that implements the limited disclosure policy is: 
(1) investment-grade debt with face value: 


1-1 





и-(1-ре КЊа-1<е5——————— e ey 
1-(1-2-43] 
2 
(2) investment-grade debt with face value: 
1+е- -1 
pret if e) ex Ё = 20; 
7 7 о 
De A 2 +Ар 
(3) speculative debt with face value: 
(2-491) +1+ (2448), 
2 2 "E ud 520 
— A iio ээг еру and 
Л 49 Ар+Ау 
2 ^U 


(4) the limited disclosure policy cannot be implemented by any feasible security if e = 20. 


FIGURE 2 
Optimal Securities that Implement the Limited Disclosure Policy 
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Proposition 3 establishes that there is a simple two-level hierarchy of optimal securities that 
induce the manager to implement the limited disclosure policy: investment-grade debt is optimal 
when the volatility of the firm’s cash flows is sufficiently low, and speculative debt is optimal 
when the volatility of the firm’s cash flows is high. When the firm’s cash flow volatility is 
extremely high, no feasible security of any kind implements the limited disclosure policy. 

The intuition for the results in Proposition 3 is clear: as noted in the discussion of Proposition 
2 above, risk-free debt never induces the manager to implement the limited disclosure policy. 
Among the risky securities that induce the manager to implement the limited disclosure policy, the 
manager prefers those securities whose payoffs do not vary much with the firm’s cash flows 
because the absence of much volatility in a security’s payoff reduces the informational advantage 
of the informed trader over the market maker, thereby reducing the trader’s expected trading 
profits and driving down the firm’s cost of capital. More than this can be said, however. When 
choosing among those risky securities that induce the manager to adopt the limited disclosure 
policy, the manager prefers securities that have only downside risk. Such securities have a lower 
cost of capital than do securities that also have upside potential, for the reasons described in the 
discussion following Corollary 1. 

When д=и-е, securities with only downside risk are of the form (dq, Ap, Ду) 
=(д-е, Ар, 0); such securities are investment-grade debt. An optimal security that implements 
the limited disclosure policy is, therefore, investment grade debt with downside risk Ap that 
satisfies the disclosure constraint in Equation (7). This explains part (1) of Proposition 3. 

As the volatility in the firm’s cash flows increases, the volatility of the payoff of the optimal 
security must increase as well. The cost of issuing a security with more volatile payoffs is that the 
information advantage of the trader over the market maker goes up, which increases the investor’s 
expected trading losses. In order to compensate the investor for her increased trading losses, the 
manager must issue debt with a higher face value, resulting in a higher cost of capital. This 
explains part (2) of Proposition 3. 

For sufficiently high cash flow volatility levels, even securities with maximal downside risk 
(but no upside potential) are unable to induce the investor to supply financing. To construct 
feasible securities in such cases, the manager must resort to securities that have both downside risk 
and upside potential. Such securities divert all the firm’s cash flows to the security holder, unless 
the most favorable (highest cash flow) state occurs. These securities are speculative debt. This 
explains part (3) of Proposition 3. Financing the project eventually becomes infeasible at very high 
levels of the firm’s cash flow volatility, because the investor’s expected trading losses become so 
high that ‘no feasible security would compensate her adequately for her expected trading losses. 
This explains part (4) of Proposition 3. 


Optimal Securities that Implement Expansive Disclosure 

The program that determines the optimal security that implements the expansive disclosure 
policy is also a linear program in the two variables Ap and Ay, which can be solved geometrically. 
As was the case for the linear program in the preceding section, to obtain the optimal security, all 
that has to be done is to search for the lowest iso-cost line for the objective function, 1.е., the 
lowest value k among the lines АБА, АДА у = КЕ that lies within this constraint set. Figure 3 
illustrates the linear program that yields the optimal security that implements expansive disclosure. 

The following proposition summarizes the findings from this geometric investigation. 


Proposition 4: An optimal security that implements the expansive disclosure policy is: 
(1) risk-free debt with face value: 


РеПуи-е) ifexsp-l; 
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FIGURE 3 
Optimal Securities that Implement the Expansive Disclosure Policy 
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(2) investment-grade debt with face value: 
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Proposition 4 establishes that there is a simple three-level hierarchy of optimal securities that 
induce the manager to implement the expansive disclosure policy: risk-free debt is optimal when 
the volatility of the firm's cash flows is extremely low; investment-grade debt is optimal when the 
volatility of the firm's cash flows is too high to make risk-free debt feasible, but is still sufficiently 
low; and hybrid securities are optimal for higher levels of volatility. As was also true for the 
limited disclosure policy discussed in the previous section, when the firm's cash flow volatility is 
extremely high, no feasible security of any kind implements the expansive disclosure policy. 

The intuition for part (1) was explained in the discussion of Proposition 2. The intuition for 
parts (2) and (3) is somewhat similar to that for Proposition 3: as the volatility in the firm's cash 
flows increases, the volatility of the feasible securities that induce the investor to supply financing 
also increases. In the discussion following Corollary 1, we observed that the investor's trading 
losses depend on the price the market maker sets following no disclosure and an order flow of 0. 
When the manager implements the expansive disclosure policy, the non-disclosure price is a 
weighted average of y (in case the manager strategically withheld information) and Я) = Fh 
+ (1- ту + 244 (in case the manager did not receive information). Hence, the non-disclosure 
price overweights only the low outcome of the security (relative to the prior probability). In order 
to minimize the investor’s trading losses—or, equivalently, minimize the firm's cost of capital— 
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the manager chooses a security that sets the payoff in the low state to its maximum value, yy 
= д. – е, but is indifferent between increasing Ше security’s payoff in the intermediate or high state. 
It follows that debt securities are the optimal securities that induce the expansive disclosure policy 
(to the extent they satisfy the monotonicity and limited liability constraints). To compensate the 
investor for her trading losses as the volatility in the firm’s cash flows increases, the manager must 
offer a higher face value on its debt instrument. 

However, doing so also reduces the manager’s incentives to implement the expansive disclo- 
sure policy. When the cash flow volatility extends beyond the critical threshold elm no feasible 
investment-grade debt will continue to induce the manager to adopt the expansive disclosure 
policy. For higher volatilities, the manager needs to issue hybrid securities with not only downside 
risk, but also upside potential, in order to induce the manager to disclose the intermediate outcome 
(as required under expansive disclosure policies). Part (4) of Proposition 4 follows for reasons 
similar to part (5) of Proposition 3: at very high levels of cash flow volatility, the manager cannot 
finance the project. 


Global Hierarchy of Securities and Disclosure Policies 


So far, we have established the existence of two hierarchies of securities, one for securities 
that optimally implement the limited disclosure policy and one for securities that optimally imple- 
ment the expansive disclosure policy. We next develop a combined hierarchy that involves opti- 
mizing across both securities and disclosure policies. We call the optimal security and disclosure 
policy that solves this joint optimization problem the “globally optimal” security and disclosure 
policy. 

By relying on the results already established, we can get an immediate sense of what consti- 
tutes the globally optimal security and disclosure policy in the “corner” case in which the firm’s 
cash flow volatility is sufficiently low (i.e., e < u — 1). For that case, Proposition 2 establishes that 
the expansive disclosure policy necessarily dominates because risk-free securities induce the ex- 
pansive disclosure policy but not the limited disclosure policy. Appealing to the continuity of our 
optimization program, the expansive disclosure policy is also the globally optimal disclosure 
policy for cash flow volatilities e close to but higher than ц — 1. In that case, the associated 
globally optimal security is investment-grade debt (by Proposition 4, part 2). 

We can also get a sense for what constitutes the globally optimal security and disclosure 
policy in the other “corner” case in which the firm's cash flow volatility is sufficiently high. When 
the firm's cash flow volatility is sufficiently high, the only potentially feasible security is one that 
transfers nearly all of the firm's cash flows to the investor—i.e., one for which both the downside 
risk and upside potential of the security are nearly maximal: Ap == e => Ay. Given this observation, 
then the result in Corollary 1 that А? + А0, < А}, + AL leads immediately to the conclusion that the 
globally optimal disclosure policy must be the limited disclosure policy, as that result implies that 
а firm's cost of capital under the limited disclosure policy, АОЛ + AA, = (А + Абје, will be 
less than its cost of capital under the expansive disclosure policy, АБАр + AyAy = (Ap + Ap)e. 
Appealing to continuity a second time, we can further conclude that the limited disclosure policy 
is the globally optimal disclosure policy for all cash flow volatilities close to but lower than the 
maximum volatility for which the project can be implemented. The associated globally optimal 
security is speculative debt (by Proposition 3). 

Having settled the question of what constitutes the globally optimal security and disclosure 
policy when the firm's cash flow volatility falls in the "corners," it remains to determine the 
globally optimal security and disclosure policy when the cash flow volatility assumes “interme- 
diate" values. The next proposition provides a complete answer to this question. 
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Proposition 5: There exists a unique value, e*, for the volatility of the firm’s cash flows, 
with e* є (ш- 1, el), such that: 


(1) if 0<e<e*, then expansive disclosure combined with either riskless debt (when 
e S y - 1) or investment-grade debt (when e > р — 1), as described in Proposition 4, is 
optimal; 

(2) if еж « e « 20, then limited disclosure combined with either investment-grade debt 
(when e = е0) or speculative debt (when e > е?) as described in Proposition 3, is 
optimal; and 

(3) Не= 2, then the project cannot be implemented by any feasible security. 

Proposition 5 establishes that the manager prefers the expansive disclosure policy in all 
environments characterized by a sufficiently low level of cash flow volatility, whereas he prefers 
the limited disclosure policy in all environments characterized by higher levels of cash flow 
volatility, up to some maximum cash flow volatility beyond which the project cannot be imple- 
mented by any disclosure policy (either limited or expansive). In addition, Proposition 5 estab- 
lishes that, for all cash flow volatilities for which the manager can implement the project, the 
optimal security is always a debt security. The debt security that is globally optimal is riskless debt 
when the cash flow volatility is sufficiently low, becoming investment-grade debt as the cash flow 
volatility increases, and speculative debt as the cash flow volatility increases still further. 

Another feature of Proposition 5 worth mentioning is that hybrid securities are never optimal. 
Hybrid securities have one function: to induce expansive disclosure for parameter values such that 
standard debt alone would not induce expansive disclosure. Relative to the standard debt security, 
the hybrid security exhibits more upside potential and less downside risk. Therefore, the manager 
trades off limited-disclosure downside risk (when using standard debt) and expansive-disclosure 
upside potential (when using the hybrid). However, following the intuition given in Corollary 1, 
the limited-disclosure downside risk is less costly than the expansive-disclosure upside potential. 
It then follows that using the standard debt instrument is preferable to encouraging expansive 
disclosure using the hybrid security. 

Perhaps the most important prediction that emerges from Proposition 5 is the following 
corollary: 

Corollary 2: In equilibrium, there is a negative association between firms' cost of capital and 

the amount of information voluntarily disclosed by firms. 


This prediction follows from two previously established facts. First, as a firm's cash flow 
volatility e increases, its maximum expected profits decline due to the firm's cost of capital 
increasing (this follows by combining Propositions 3, 4, and 5). Second, as a firm's cash flow 
volatility increases, it optimally switches from the expansive disclosure policy to the limited 
disclosure policy (by Proposition 5). 

This negative correlation does not imply, however, that more expansive voluntary disclosure 
causes firms’ cost of capital to decline. On the contrary, the model predicts that, whenever the firm 
issues risky debt, more disclosure leads to a strict increase in the firm's cost of capital. To see this, 
first consider investment-grade debt. The cost of capital associated with issuing investment-grade 
debt, (i — e, Ap, 0), is ATA. Expansive disclosure yields a greater cost of capital for investment- 
grade debt than does limited disclosure because, by Corollary 1, Аб < A}. Next consider specu- 
lative debt. The cost of capital associated with issuing speculative debt and adopting the limited 
disclosure policy is Абе +АЗАу. This is always less than the cost of capital associated with 


19 The proof of the proposition establishes that е1, < е0. So, since е0, < е0, by Proposition 3, each of the intervals 
(e* , elp] and (eO, , 265] is nonempty. 
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issuing speculative debt and adopting the expansive disclosure policy, Abe + АД у because the 
downside risk, Ap, is higher than the upside potential, Ay. Thus, even though for any given risky 
debt security, there is a positive association between a firm’s cost of capital and how much it 
discloses, when the firm chooses its securities optimally, the equilibrium (hence, predicted) asso- 
ciation between a firm’s cost of capital and how much the firm discloses is negative. 


IV. INTRODUCING MANDATORY DISCLOSURES 
In this section, we extend the model by considering how mandatory disclosure requirements 
affect the manager’s choice among securities, the manager’s propensity to make supplementary 
voluntary disclosures, and the firm’s cost of capital. We consider two distinct forms of mandatory 
disclosure requirements. One form, which we label “loss recognition,” requires that the manager 
disclose the low outcome (6,) when he knows it. The other form, which we label “gains recogni- 
tion,” requires the manager to disclose the medium outcome (6,,) when he knows it. 


Optimal Security Design under Mandatory Loss Recognition 

The next proposition describes the manager’s preferred voluntary disclosure policy under 
mandatory loss recognition. Under mandatory loss recognition, the manager always discloses both 
the low and the high outcomes. Whether he discloses the medium outcome depends on the relative 
magnitude of the investor’s downside risk and upside potential, as the following proposition 
indicates: 

Proposition 6: Under mandatory loss recognition disclosures, if the manager offers the in- 
vestor a security (i, Ap , Ay) that satisfies the limited liability and monoto- 
nicity conditions, then the possible voluntary disclosure policies d(-) the se- 
curity can induce the manager to adopt are the following: d(6,) = 0,, 4(6,) 


= Ө, апа: 
if Ap > Ар, then 4(6,) = Ø; (11) 
if Ap < Ар, then 4(0,) = да; (12) 


and if each strict inequality in (11) and (12) is replaced by an equality, then 
the security can induce the manager either to disclosure, or not to disclose, 
O= 6,. 


The intuition for this result is as follows. Whether the manager discloses the medium outcome 
depends on how the value of his residual claim compares when he discloses 6, versus when ће 
does not make a disclosure. Since the manager discloses both the low and high outcome under 
mandatory loss recognition, the value of his residual claim given the medium outcome is higher 
(lower) when the manager is exposed to more (less) downside risk than upside potential. Equiva- 
lently, the manager chooses to disclose (withhold) the medium outcome when the investor cap- 
tures less (more) of the downside risk than the upside potential, i.e., Ар < Ap (Ap > Ay). 

Juxtaposing this disclosure policy with Lemma 1 provides a dramatic illustration of how 
mandatory disclosures can affect a firm's voluntary disclosure policy. Because the inequality (3) 
characterizing nondisclosure in Lemma 1 can be written alternatively as Ду, < PE and since, for 
all Ap e [0, e), it is clear that 2272 < Ap, it follows that any security (ij, Ар, Ay) that induces 
the manager not to disclose 0, when there are no disclosure requirements will also induce the 
manager not to disclose 0, when the mandatory loss recognition disclosure requirements are 
imposed. Thus, in the case of mandatory loss recognition disclosures, holding the security the firm 
issues fixed, mandatory disclosure inhibits voluntary disclosure. We show in Proposition 7 below, 
however, that if the manager is allowed to optimize his choice among securities (when the dis- 
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closure regime changes from all disclosures being voluntary to the mandatory loss recognition 
regime), then mandatory disclosure leads to expanded voluntary disclosure. This conclusion thus 
provides yet another illustration of the importance of endogenizing a firm’s capital structure when 
making predictions about disclosures: the predictions about disclosures that emerge from holding 
capital structure fixed are the opposite of the predictions that emerge when the firm’s capital 
structure is endogenized. 

When Ap < Ay, the manager engages in full disclosure of his private information and the 
market maker and the investor are at informational parity with the informed trader. Accordingly, 
the security will always be correctly priced, the investor will not incur any expected trading losses, 
the firm’s cost of capital will be 0, and the first-best outcome will be achieved. The next propo- 
sition shows that the manager can exploit this result and achieve first-best by issuing an equity 
security. 


Proposition 7: Under mandatory loss recognition disclosures, for all e є (0,2), the equity 


security феї (i.e., (h, Ap, Ap) e 17 5,5, 5) induces the manager to 
engage in full disclosure and achieves the first-best outcome.” 

Notice that no risky debt security will induce the manager to fully disclose his private infor- 
mation, because no risky debt security satisfies inequality (12). Thus, no risky debt security 
induces the manager to engage in a policy of full disclosure; hence, no risky debt security com- 
pletely eliminates the informed trader’s information advantage over the market maker and yields 
the first-best outcome. 

We conclude this subsection with two further comments concerning Proposition 7. First, the 
proposition illustrates that what are considered to be optimal securities can change with the 
introduction of mandatory disclosure requirements, because equity securities are typically not 
optimal when all disclosures are voluntary. Second, the proposition illustrates how mandatory 
disclosures can lead to an increase in the manager’s maximum expected profits. We shall see in the 
next subsection, however, that mandatory disclosures do not always increase the manager’s maxi- 
mum expected profits. 


Optimal Security Design under Mandatory Gains Recognition 

In this subsection, we discuss the mandatory disclosure regime referred to as “gains recogni- 
tion” above, which entails that the manager must disclose the medium outcome (6,,) when he 
knows it. 

Under gains recognition, the manager will always keep silent when he learns that 6-0, 
occurs—as he cannot possibly benefit by revealing that the firm’s cash flows are the lowest 
possible level. Because the manager will always disclose 0-0, when he learns it, and he neces- 
sarily discloses @= 0, when he learns it to be in conformity with the gains recognition mandatory 
disclosure requirement, it follows that the disclosure policy the manager adopts in the presence of 
mandatory gains recognition is tantamount to the manager implementing the expansive disclosure 
policy. From this observation, it is immediate that mandatory gains recognition disclosures some- 
times reduces the manager’s maximum expected profits relative to the situation in which all 
disclosures are voluntary. These remarks prove: 


Proposition 8: Under mandatory gains recognition disclosures, the disclosure policy the 
manager is induced to adopt is always the expansive disclosure policy, and 


20 The proof of Proposition is immediate: since Ap=Ay, condition (12) holds (weakly) for the equity security 
(th, Ap, Ay) =O arte ©), so the manager (weakly) prefers full disclosure. 
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the manager’s maximum expected profits are always weakly (in fact, strictly 
for sufficiently high levels of cash flow volatility) below what the manager 
could achieve when all disclosures are voluntary. 


Viewing mandatory disclosures as a commitment to a disclosure policy, this result shows that 
commitment to a disclosure policy can reduce the manager’s maximum expected profits. While it 
is commonplace to observe that mandatory disclosure requirements may be undesirable because 
they generate proprietary costs of disclosure, Propositions 7 and 8 combined indicate that unquali- 
fied statements about the desirability or undesirability of expanded mandatory disclosure require- 
ments, or unqualified statements about the desirability of committing to expanded disclosures, 
typically are invalid even absent concerns about disclosing proprietary information. 


У, CONCLUSION 

The owners of a firm use all of the instruments at their disposal, including their operating 
choices, financing choices, investment choices, and disclosure choices, to maximize the expected 
value of their residual stakes in their firm. Because the array of such instruments available to a 
firm’s owners is in practice so large, any model must inevitably put some restrictions on the range 
of endogenous variables the owners of a firm are posited to select. The present model demon- 
strates that, for purposes of studying a firm’s disclosure policy, it is undesirable to take the firm’s 
financing decisions as given, because as the firm’s financing decisions change, the residual stake 
over which the owners of the firm have claim change, and the latter change alters the owners’ 
preferred voluntary disclosure policy. We show that the reverse is also true. Namely, the capital 
structure the owners of a firm prefer also depends on the firm’s disclosure policy. These interde- 
pendencies imply that, in equilibrium, a firm’s capital structure and disclosure policy are jointly 
determined, and together determine the firm’s cost of capital. 


APPENDIX 
Proof of Lemma 1 


In view of the assumed monotonicity of 9 — б), the unique optimality of 4(9,) = 6, and 
4(0) = Ф is clear. Consequently, the proof focuses on the specification of d(6,,). If Equation (3) 
describes the manager’s disclosure policy, then E[ 6— 8) | d] yields: 


Е0- ба = 6,] = 6, ~ V 


and, by Bayes’ rule: 


а -pa, - £3) +p 0 – 76, - m) 
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~ 7 7 ; 
ED] - - пи Ge th) = You LU М — (n #0]. 
Given the restriction to truth-telling, were the manager to disclose d= Om, then necessarily: 


E[6- да = Op] = Omn- Yin 
So, for the disclosure policy in Equation (3) to be an equilibrium, we require: 


ELO- WO|d= 6,] = В 8- ја =D], 


~ 7) ~. 

Й ‚(а - 1) Um Е) + Де EU + и) 
On- УУ - — — — — ——— — — — 
1-р2 

2 
which simplifies to: 
ре = åp- (1 - р)Ду. 
The proof ої Equation (4) is analogous and omitted. 
Proof of Proposition 1 
The investor’s expected payoff is: 
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Pr(d = @,Х =- 1|д =1)+Рг(4= Ø,X 2 0|q = 1) + Pr(d= Ө) + Pr(d = и) =1. 
The price P((2,X) can be written as: 
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Market Maker’s Beliefs if the Manager Implements the Limited Disclosure Policy 

Let Sjrowcash = 0 indicate that the manager learns that cash flows will be 6, and Sinowcash = © 
indicate that the manager receives no information about cash flows. Suppose the manager does not 
disclose the median value and the market maker conjectures that the trader does not buy the 
security if she sees буне = 1 and purchases the security if she sees 5„ильем = 0. Then, the market 
maker's beliefs are as follows. 

If = 1 and d = ©, then the trader must have purchased the security because бре = 0. 
Hence, the market maker beliefs equal the prior beliefs. 


Pr(6,| D, X = 1,4- 0) = Pr( визнане = 0) = 


юз 


Pr(8|, X = 1,4 =0) = Pr( O|Swithheid = 0) = 2 


If X = 0 and d = Ø, then either (1) the trader purchased the security because ими = О and there 
was a liquidity shock (which occurs with probability (1 — p)v), or (2) the trader did not purchase 
the security because зри = 1 (which implies Sprowcash Є (44, 0) and there was no liquidity 
shock (which occurs with probability ра - 2 ~ v)). Hence, the market maker's beliefs are: 


7 
50-р) 
Pr(6,,d = ©,X = 0) 2 2) 
САВА фак) a rt prs, Rt 
Зея ар (1-2)ра -9 2 
халил ^24 oP ee) 
Pr(6,|O,X = 0,4 = 0) = а ——___—_____ » = 
Pr(d = @,Х = 0) 7 
(1-p)u+ Ї = пра -1) 


If X = —1 and d = Ø, then from X = - 1, the market maker can infer that the trader did not 
purchase the security because била = 1. Hence: 


Р (0,6,Х--1,4-0)-0 


7 
2 7 
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Based on these beliefs and: 
Pr(d=@,X=0|q=1,d=0)=(1—p) 
Pr(d = 9 -2Р 
РКа= u) =0 
we can compute: 
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Market Maker’s Beliefs if the Manager Implements the Expansive Disclosure Policy 
Following the same logic as before, we can calculate: 


Л 
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If = —1 and d = Ø, then from X = —1, the market maker can infer that the trader did not 
purchase the security because Syipneg= 1 and Sprowcash = д. Hence: 


Р (9,0,Х--1,4-1)-0 


РГ(0|06,Х--1,4-1)-41. 
Based on these beliefs and: 


Pr(d=,X=~1|q=1,d=1) = Tp 
РЦ(4-0,Х-0|4-1,4:1)-0(1-р) 
Раз 6) = 2n 


Pr(d= и) = (1 Др 


we can compute: 





2070-9 
Ay=ul -Р)> 
7 
(1-p)u+ Pu -1) 
and: 
( - пра -1) 
1 7) 2 
Ap z 41-р) А 
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Proof of Proposition 3 
The Optimal Security Design а = 0 (0500) Program 
min АСА, АД, (13) 

Ар 

subject to: 
pe 5 åp- (1 - р)Ду (14) 
1ee- ns (1-2–аоја (2-22); (15) 
0=4,=е; ie(D,U). (16) 


The feasible set of Ap and Ay is then determined by Equation (15) to be that subset of the square 
[0,е] х [0,e] in (Ар, Ay)-space lying below the line: 
An- 
Ау= 2—2 (17) 
1-р 
and above the investor’s "supply-financing" line: 





Ауе. (18) 


We start by observing that, in ай non-first best cases (1.е., when 1 + e ~ p > 0), the investor’s 
“supply financing" line (18) intersects the positive orthant in (Ар, Ay)-space. The key to deter-- 
mining the smallest value attained by the objective function (13) in the constraint set is the 
observation that the (downward-sloping) iso-cost lines of the objective function (13) are flatter 
than the slope of the line (18), as the following lemma states. 


Lemma 3 

Ар 21-02 -A9 

Ay 72-Ар 
Proof 

Rewrite 

А and А0 as Аб = Ai and A% = | = т, 
where: 

1- 

Д Р ДАН 

(1 –рји+ ( 5 2 ра -4) 


With this notation: 
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4 0 7 
А — 
AD 2 2 
lg NOU" WR MT (19) 
9 Ї De 1-2 
2 2 
1-72-46 0 199328 і 
4 Then it is easy to confirm that the inequality іп the statement of the 


lemma is equivalent to 0 < 1 — 7), which always holds. 

Given this fact, it follows that the optimal security is defined by the “most southwest” point 
of the constraint set (14)-(16), as illustrated in Figure 2. We identify what that “most southwest” 
point is by studying the following four mutually exclusive and exhaustive cases (for e > д — 1). 


• In case 1 (“low volatility"), the point on the line (18) lying on the Ap-axis, i.e., the point 


(Ао, A9 = па lies to the left of (Ap, Ay) = (ре, 0)! This case occurs when 
-2. 


lte- < : -1 
e| ie, when es 
1-2-4} Р | 1-(1-2-45)p 


(Ap, Ay) in the feasible set is (ре,0). This corresponds to the security (ці, Ap, Ay) 
= (ш-е,ре, 0). This security is investment-grade debt with face value р — (1 — рје. 

* In case 2 (“medium volatility"), (До, AD) lies to the right of (Ар, Ay) = (ре, 0) but to the 
left of (Ap, Ay) = (e, 0). This case occurs when e e and 240 < е i.e., when e 


= еб. In this case, the "southwest-most" 


1-2-4, 
2 
<es ret = eor. In this case, the “southwest-most” (Ap, Ay) in the feasible set is 
2+ 
(А, AD). This corresponds to фе security (Jj, Ap, Ay) = [ace ~ 2 : 0) This secu- 
-2. 


rity is investment-grade debt with face value ш — e + d 
-1- 
• Incase 3 (“high volatility”), (А5 , AD) lies to the right of (А, Ay) = (е, 0) and the point 
(Ap, Ay) on the line (18) evaluated at Ap =e lies inside the box [0,e] X [0,e]. This case 
Vee-u-(1-21-4)e 


occurs when e > е0 апі 
ad (2-49) 


=] 2 
же, 80 бр < e < ду = ёр. In this case, the 


Lte~p-{ 1-2-48)е 
R 


> 
2 


| This security is speculative debt 


"southwest-most" (Ар, Ay) in the feasible set is |е, | This corresponds to 


\ 1e-n-(1-2-45)e 
the security (Jj, Ap, Ay) = ЗИЛ EE 
иду) р 
2-а}, | 
• Incase 4 (“extremely high volatility”), (49 ; AS) lies to the right of (Ap, Ay) = (е, 0), but 
the point (Ap, Ay) on the line (18) evaluated at Ap = е lies outside the box [0,e] X [бе]. 
This case occurs when e = zp In this case, no feasible security can implement the limited 
disclosure policy. 
Substituting the values of the downside risk and upside potential of the securities that have 
been identified to be optimal in each of these four cases into the expression for the firm’s cost of 
capital in Proposition 1 completes the proof of Proposition 3. | | 


with face value 


21 Also note that (A2 , 49) lies to the right of (А„, Ду) = (0,0) whenever first-best is not attainable. 
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Proof of Proposition 4 
This proof is similar in style to Proposition 3, so its proof is omitted. Nu 


Proof of Proposition 5 
We start the proof of the proposition by proving the following Lemma. 


Lemma 4: For any e > єр for which there exists a feasible security—call it "security 1”-- 
that implements the expansive disclosure policy, there exists a feasible debt 8е- 
curity that implements the limited disclosure policy that increases the manager’s 
expected profits relative to that obtained from security 1. 

Proof: Suppose e > ely and some security, say “security 1,” eliciting expansive disclosure 
exists. Then, we are in case 3 of Proposition 4. As the proof of Proposition 4 shows, 
optimal securities in this case are hybrid securities determined by those securities on 
the “supply-financing” line: 


неза (1-2-4 dp 

AU = ——————— 

7 

2749 

that Не in the constraint set for Program OSD1. Single out the optimal security on this 
line that also lies on the boundary line: 


ре-Ар-(1-р)Ау ^ Q0) 


of the constraint set. Call this latter security "security 2," and denote its components 
by (ud, А2, A2). Security 2 necessarily generates at least as high expected profits for 
the manager as does security 1, because security 2 is optimal for Program OSD1 and 
security 1 is merely feasible for this program. Recall by Lemma 1 that securities that 
lie on the line (20) can induce the manager to adopt either the limited or expansive 
disclosure policy. In particular, this is true for security 2. 
We now make two claims: security 2 is feasible for Program OSDO, and security 2 yields the 
manager higher expected profits under that program than under Program OSDI. This will prove 
that the manager is better off implementing the limited disclosure policy upon being given security 
2 rather than the expansive disclosure policy. Notice that the firm's cost of capital when the 
manager implements the expansive disclosure policy with security 2 satisfies: 


ApAp + AyAy=Ap(pe + (1 - p)Ay) + Ab Ay 
= (Ap +Ay)Ay + Apple – Ay) > (Ap + Ay) Ay + App(e — Ay) 
=A} (pe + (1 - р)Ду) + Ag Ду = ABAp АД, (21) 
where the-inequality follows from Corollary 1 and the first and last equalities follow from security 
2 lying on (20). Of course (21) is equivalent to: 
( - > Ab) p+ (2 -АЬе х ( - - АВА (2 Аль (22) 


(22), along with the feasibility of (Y? , A5, A2) for Program 0801 and the fact that (02, A2, A2) 
lies on (20), implies the feasibility of (42, Д2,, A2) for Program OSDO. (21) implies that the 
manager’s expected profits are strictly higher by implementing the limited disclosure policy than 
the expansive disclosure policy upon being given security 2. This proves the two claims. 
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To conclude the proof, recall that Proposition 3 established that debt securities always opti- 
mally implement the limited disclosure policy, so there is some debt security, say “security 3,” that 
is at least weakly preferable to security 2 that implements the limited disclosure policy. Since 
security 2 is weakly preferable to security 1, the proof of the lemma is complete. 


Proof of Proposition 5 

The proof of this proposition consists of the following three observations. First, the function 
defining the manager’s maximum expected profits when he implements the limited disclosure 
policy is linear in e over the interval є є (1-1, е1]; the same is also true of the function defining 
the manager’s maximum expected profits when he implements the expansive disclosure policy (for 
this same interval (4-1,61512 Second, the manager’s maximum expected profits are strictly 
higher under the expansive disclosure policy than under the limited disclosure policy at, and—by 
continuity—in the vicinity ође = р — 1. (This follows from the discussion surrounding Propo- 
sition 2.) Third, the manager's maximum expected profits are strictly higher under the limited 
disclosure policy than under the expansive disclosure policy at, and—by continuity—in the vicin- 
ity of e = e]. (This follows directly from Lemma 4 above.) Hence, these expected profit functions 
must cross exactly once in the interval (а – 1, е |, which defines the point е; in the statement of 
the proposition. The conclusion about the global optimality of risk-free debt, investment-grade 
debt, or speculative debt then follows directly from Propositions 3 and 4. 


Proof of Proposition 7 


Disclosure of 0, is preferred over non-disclosure of 6,, when the following condition is 
satisfied: 


9, x Чи = #10– фаз 2,4(0,) = 2], 
where RHS of фе inequality is given by: 
p – (0. у) + (1 -»(. z (2, + (1 = ул + у | 


ELO- yd = 2,4(8,) = 2]- р(1- 7) +(1–р) 


Simplification yields: 


On — Wn = EL O~ id = D,d(8,,) = ©] 


Unt Ш 
08-14. = Ө. ~ 2 
Yat h > р, 
2 
Z Because: (1) by Proposition 3, ће "в expected profit function when he adopts the limited disclosure policy is 


linear in e for e in the interval (4-1, ер]; (2) by Proposition 4, the manager's expected profit function is linear in є 
for e in the interval (и — 1, Єр), and so, since бір < Zip, a fortiori is linear over the interval (що 1, elo]; (3) el; 


aeri < ——#с—— шй $ «Ab, by Corollary 1 ; and so (4) both of th fit 
Дуже ita ет» because Ар < Ap, by Пагу 1 above; and so (4) of these expected pro: 
functions are linear in e for e in the interval (1-1, elp]. 
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Ду Р Ap. 


The case for which non-disclosure of б„ is preferred over disclosure ої 0,, is identical. 

_ Next, we derive the trader’s optimal trading strategy as a function of the manager’s disclosure 
policy. Suppose the manager discloses both the low and high state but not the intermediate state. 
Then, the trader prefers to buy the security when хрло = О if and only if her expected purchasing 
price, uP(2,0) + (1 — 9Р(®,1), is less than the expected payoff of the security, Е]. P(@,1) 
equals ЕД. (because the market maker can infer that 5„нда= 0) and P(Z,0) is a weighted 
average of Е V] and y. Hence, у(@,0) = 1 is optimal if E[ | > Ym or Ay > Ap. This condition 
is incompatible with the manager choosing not to disclose 6,,. Hence, the trader does not buy the 
security when Swshheia = О. The trader prefers to buy the security when 5,444, 1 if and only if her 
expected purchasing price, uP((2,0) + (1 — v)P(@,]1), is less than the payoff of the security, фу, 
P((2,1) equals V, (because the market maker can infer that 5,44, 1) and P((2,0) is a weighted 
average of ЕР] and Yp. Hence, y(@,1) = 1 is optimal if у, > Ё ог Ay < Ap. This condition 
is consistent with the manager choosing not to disclose 9„. Hence, the trader buys the security 
when Swine = 1. Clearly, the trader never buys the security when the manager fully discloses his 
private information. ш 
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ABSTRACT: This study jointly evaluates firm-level changes in Investor composition and 
shareholder distributions following a 2003 reduction in the dividend and capital gains 
tax rates for individuals. We find that directors and officers, but not other Individual 
investors, rebalanced their portfolios to maximize after-tax returns in light of the new tax 
rules. We also find that firms adjusted their distribution policy (specifically, dividends 
versus share repurchases) In a manner consistent with the altered tax Incentives for 
individual investors. To our knowledge, this Is the first study to employ simultaneous 
equations to estimate both shareholder and managerial responses to the 2003 rate 
reductions. We find that the generalized method of moments (GMM) estimates are 
substantially stronger than OLS estimates, consistent with our expectation that investor 
and manager responses are simultaneously determined. Failure to estimate systems of 
equations may account for some of the weak and conflicting results from prior studies of 
the 2003 rate reductions. 
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I. INTRODUCTION 
his study employs simultaneous equations to assess the responses of both investors and 
| companies to the 2003 reductions in dividend and capital gains tax rates. We predict that 
individual investors, the only ones affected by the reduction in shareholder taxes, rebal- 
anced their portfolios to maximize after-tax returns in light of the new tax rules. We also predict 
that firms adjusted their distribution policies (specifically, dividends versus share repurchases) to 
maximize share value—i.e., distributing profits in a manner that was most attractive to their 
investors after considering shareholder taxes. With regard to investor responses, we find evidence 
that insiders (ie., directors and officers) increased holdings in their own companies if their 
dividend-repurchase mix reflected the new tax incentives. However, we find no evidence that other 
individual investors rebalanced their portfolios. With regard to managerial responses, we find that 
firms with disproportionately large individual holdings modified their payouts in a manner con- 
sistent with the altered tax incentives. However, changes in dividend and repurchase policy were 
not immediate; firms deferred widespread, substantial changes until the second quarter following 
enactment. 

To our knowledge, this is the first study to jointly estimate investor and firm responses to 
changes in shareholder taxes. The fact that both investors and firms can change their behavior 
following a change in shareholder taxes presents an identification problem. To illustrate, suppose 
we test for an association between dividend yields and individual stock ownership and find that the 
correlation becomes more positive following a reduction in individual dividend tax rates. Such a 
finding is consistent with both (1) individuals switching to high-dividend-paying firms following 
the tax cut (ie., a tax clientele response) and (2) firms that are held mostly by individuals 
increasing their dividends following the tax reduction (a firm payout response). To distinguish 
between investors rebalancing their portfolios and firms altering their distributions, this study 
estimates simultaneous equations. Prior studies have focused on either investors or managers, but 
not both, and results have been weak and in conflict. By adopting simultaneous equations, we find 
evidence of substantially stronger investor and managerial responses to a decline in shareholder 
tax rates. 

To maximize the power of our tests, we compare a firm’s ownership and dividend-repurchase 
mix before and after the largest change in U.S. dividend taxation. The Jobs and Growth Tax Relief 
Reconciliation Act of 2003 (JGTRRA) reduced, for individuals only, the maximum, statutory 
dividend tax rate from 38.6 percent to 15 percent. It also lowered the maximum, statutory indi- 
vidual capital gains tax rate, which applies to share repurchases, from 20 percent to 15 percent. In 
many ways, JGTRRA is an ideal legislative setting for testing an association between shareholder 
taxes and payout policy. The scope of JGTRRA was narrow. Its genesis was individual dividend 
tax reduction. Its primary amendment was individual capital gains tax relief.' The final bill did 
little more than reduce dividend and capital gains tax rates. Nonetheless, the economic effects 
were huge.” Because of its narrow focus and big impact, the JGTRRA provides a much stronger 
setting than other shareholder tax rate changes, which were either much smaller (e.g., Tax Relief 
Act of 1997) or involved widespread overhaul of the tax system that affected far more than just 
shareholder taxes (e.g., Tax Reform Act of 1986). 

The JGTRRA dividend and capital gains tax cuts should have altered the optimal mix of 
dividends and repurchases for at least some individual investors. As a result, we expect that some 
individuals rebalanced their portfolios so that a higher proportion of their equity returns came in 


! For the legislative history, see Auerbach and Hassett (2007). 

? The size of the tax savings has been very large. In September 2007, the U.S. Congress’ Joint Committee on Taxation 
reported that the 2003 dividend and capital gains tax rates will cost the Treasury $632 billion between 2007 and 2011, 
the largest tax expenditure in the tax code. 
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the form of dividends. Consistent with such portfolio rebalancing, Desai and Dharmapala (2011) 
report that total U.S. equity investments shifted from foreign countries, whose companies did not 
qualify for the lower dividend tax rates, to foreign countries, whose companies did quality for the 
lower dividend tax rates. However, to our knowledge, no one has documented whether individual 
investors rebalanced their much larger holdings of domestic holdings in a manner consistent with 
the changed tax incentives following passage of the JGTRRA. This study addresses that void by 
studying the domestic portfolio rebalancing of three different individual investor groups. 

To attract individual investors who were looking for more dividends following passage of the 
legislation, we expect that at least some firms increased the dividend portion of their distributions. 
Consistent with such a managerial response, Chetty and Saez (2005) document that dividend 
initiations jumped in 2003. Brown et al. (2007) find that reductions in share repurchases funded 
these 2003 dividend initiations, with such substitution limited to companies where directors and 
officers held disproportionately large shares. They find no similar managerial responses among the 
set of firms that account for almost all dividend issuances——i.e., the firms paying dividends before 
JGTRRA, or among firms without high insider ownership.? Comparing executive compensation in 
2003 and 2002, Aboody and Kasznik (2008) also reach mixed conclusions about the changes in 
the dividerid-repurchase mix. 

We conduct a more comprehensive study of managerial responses to JGTRRA by looking at 
dividend-paying firms (which far exceed initiators in number and payout), studying both insiders 
and other individual investors, and extending the analysis to include changes in distribution poli- 
cies through 2005 (two years beyond the extant JGTRRA studies). Lengthening the investigation 
period enables us to calibrate how long it took investors and managers to respond to JGTRRA. We 
also estimate a system of equations and find that results are statistically and economically more 
significant under simultaneous equations than under ordinary least-squares. 

To test for investor and firm responses, we compare the percentage of shares held before and 
after JGTRRA for three groups of individual investors: insiders (і.е., directors and officers of the 
firm), other non-executive individuals investing on their own account, and mutual fund investors. 
We aggregate each firm's dividends and repurchases during the eight quarters immediately pre- 
ceding the quarter of enactment and compute the ratio of dividends to total payout (dividends plus 
герагсВавев) З We compare that ratio to the one formed with aggregated dividends and repurchases 
for the eight quarters immediately after the quarter of enactment. We then test for an association 
between the change in shareholders and change in the dividend-repurchase mix. This difference- 
in-differences approach mitigates the likelihood of spurious conclusions arising from omitted 
correlated variables. Estimating a system of equations enables us to determine whether the asso- 
ciation is driven by tax clientele effects, payout changes, or both. 

We find evidence consistent with both investor and firm responses to JGTRRA. However, 
insiders are the only investors who appear to have altered their holdings in response to JGTRRA. 
We find stronger evidence that firms modified their payout policy in response to changes in 


3 Brown et al. (2007) are careful to state that their inferences about substitution are limited to 2003 dividend initiators. 
That said, becanse their sample includes both initiators and non-initiating firms, it was unclear to us whether the findings 
reported in Table 6 of their study shed any light on the firm responses of non-initiators. Private conversations with the 
authors confirmed that their conclusions were limited to 2003 dividend initiators. In addition, to enable us to investigate 
the non-initiators in their study more closely, the authors kindly provided us with their data, for which we were most 
appreciative. We replicated their results, and, consistent with the inferences in their study and our conversations with 
them, we found that their results only hold for firms that initiated in 2003. 

Many mutual fund investors are not individuals and the earnings for many mutual funds held by individuals are not 
subject to JGTRRA, e.g., 401(k) investments. Unfortunately, we cannot observe the extent to which JGTRRA affects a 
specific mutual funds’ investor base. This limitation biases against our finding a response by mutual fund investors. 

5 We aggregate to reduce some of the noise arising because repurchases, unlike regular quarterly dividends, are uneven 

and irregular. 
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shareholder taxes. In particular, we find that the movement toward distributing a larger proportion 
of profits as dividends was greatest among not only those companies held disproportionately by 
individual investors, particularly directors and officers, but also those firms where other individu- 
als and mutual funds had large holdings. We find that firms began to substantially alter their 
distribution policies in the second quarter following the passage of JGTRRA, consistent with firms 
needing time to adjust their dividend and repurchase policies. 

We find that the economic significance of the 2003 legislation is substantially greater using 
simultaneous equations than using ordinary least-squares (OLS). Tais is an important contribution 
of this study because, to our knowledge, we are the first study to jointly evaluate shareholder and 
firm behavior. Based solely on our OLS results, we would erroneously conclude that no investors 
rebalanced their portfolios and that the firm responses were limited to those companies with heavy 
insider ownership. Using generalized method of moments (GMM) estimation of the system of 
equations, which is the appropriate approach given our expectaticn that investors and managers 
simultaneously readjusted their behavior after 2003, we find that some investors, namely individu- 
als, altered their portfolios and that firms were responsive to insiders, mutual funds, and other 
individual tax considerations. 

The next section develops testable hypotheses. Section III details the research design. Section 
IV presents the empirical findings. Closing remarks follow in Section V. 


H. HYPOTHESIS DEVELOPMENT 
Framework 


This study does not attempt to tackle the longstanding puzzle of why firms pay dividends 
when they could distribute profits tbrough share repurchases, which remain tax-advantaged, 
though less so, even after JGTRRA.Ó Rather, in a nutshell, we: (1) accept the fact that investors 
desired and some firms paid dividends before JGTRRA (obviously for non-tax reasons), (2) 
assume that the mix of dividends and repurchases was optimal before passage of the legislation, 
(3) expect that the large tax rate reductions for dividends (compared with the relatively modest 
reductions for capital gains) led some individuals to rebalance their portfolios in favor of dividend 
income, and (4) predict that some firms, in response to the changing tax incentives for individual 
investors, increased the portion of their distributions in the form of dividends after JGTRRA. The 
remainder of this section elaborates on these relations to develop formal hypotheses about the 
impact of JGTRRA on shareholder distributions. 

To develop the intuition for our hypotheses, we start with a simple framework.’ Absent taxes, 
suppose that all investors hold optimally diversified portfolios of the risk-free asset and the market 
portfolio. In that setting, shareholder ownership of stocks should not vary across investors with 
similar risk preferences. Now suppose some investors (call them individuals) become taxed on 
their dividend income. These individual investors will no longer hold the original, pre-tax, opti- 
mally diversified portfolio. Rather, they will underweight their portfolio in tax-disfavored, 
dividend-paying stocks and overweight their portfolio in tax-favored, no-dividend stocks. This 
shift will boost the price of no-dividend stocks and drive down the price of dividend-paying 


5 For a sampling of the “dividend puzzle” literature, see Miller and Modigliani (1961), Feldstein and Green (1983), 
Bagwell and Shoven (1989), Berhheim (1991), DeAngelo et al. (2006), Jagannathan et al. (2000), Fama and French 
(2001), Grullon and Michaely (2002), Dhaliwal and Li (2006), Gordon and Dietz (2008), and Chetty and Saez (2010), 
among many others. On a different note, even though both repurchases and divideads now face a maximum tax rate of 
15 percent, repurchases remain tax-advantaged for at least two reasons. First, sellers can offset the tax basis of the shares 
that they sell against the proceeds from the sale in computing their capital gains. Second, they can offset those capital 
gains with capital losses that otherwise might not be deductible. This contrasts with dividends, where the entire amount 
is taxed upon receipt. 

We appreciate the contributions of an anonymous referee in sketching out this framework. 
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stocks. These price movements will entice non-individual investors (call them tax-exempts) to 
hold more dividend stocks and less no-dividend stocks. As a result, each investor ultimately would 
hold the portfolio that features his optimal trade-off between risk and after-tax return; the share 
price of each stock would equate supply and demand; and a heterogeneous mix of shareholders 
would emerge endogenously. One example of an empirical study of such tax clientele responses is 
Dhaliwal et al. (1999), who document increases in institutional holdings when firms initiate divi- 
dends, which are tax-disadvantaged, to individual investors. З | 

Similarly, if individuals were taxed more heavily on dividends Шап on share repurchases, then 
they would overweight their portfolios with stocks that distributed disproportionate amounts of 
profits through repurchases, as compared to dividends.” Meanwhile, non-individual investors 
would overweight their portfolios with stocks that distributed disproportionate amounts of profits 
through dividends, as compared to share repurchases. In this setting, managers would choose both 
the level and mix (dividends versus share repurchases) of shareholder distributions that maximize 
the firm’s stock price. Their distribution policy would affect both the equilibrium price and the 
equilibrium mix of investors and changes in their payout policy will induce a change in the mix of 
investors that own the firm. ° 

Now consider a reduction in the dividend tax rate. Individual investors would rebalance their 
portfolios, adding more dividend-paying stocks than was optimal under the prior high-dividend tax 
regime. This will drive up the price of dividend-paying stocks, altering the optimal portfolio mix 
for tax-exempt investors who would now shift from dividend-paying stocks to no-dividend stocks 
at the margin. At the same time that investors would be adjusting their portfolios in light of the 
changes in dividend tax policy, managers would be revising their level and mix of shareholder 
distributions in light of the new tax policy to continue to maximize their stock price. As a result, 
both the investor mix (individuals versus tax-exempts) and the firm’s distribution policy (divi- 
dends versus share repurchases) would change after a reduction in the dividend tax rate. 

Determining the relative importance of the investor response and the firm’s response would 
require joint evaluation of both investors’ and managers’ incentives. Studies that examine only 
individual investors’ new tax-motivated demand for dividends might erroneously attribute all of 
the increase in dividends to a clientele effect. In the extreme, this is true even if investors did not 
rebalance their portfolio, but rather, firms simply increased their dividend payouts. Likewise, 
studies (such as several prior examinations of JGTRRA) that focused solely on firm’s new tax- 
driven supply of dividends might erroneously attribute increased dividend income by individuals 
to а payout response when the result was actually due to individual investors rebalancing their 
portfolios. In the tests below, we find evidence of both a clientele and a managerial response to 
shareholder tax rate reductions. 


Investor Responses 


We begin our hypothesis development by focusing on potential investor responses to 
JGTRRA. We assume that because the decline in the dividend tax rate (from 38.6 percent to 15 


8 Moser and Puckett (2009) include the JGTRRA in their study of whether there is a positive association between the 
partion of dividend-paying securities in tax-advantaged institutions’ portfolios and the dividend penalty. Although they 
find evidence of a positive association, their analysis is confounded by the classification of financial institutions as 
tax-advantaged institutions (Blouin 2009). 

9 We are assuming that tax-sensitive investors do not interpret any potential dividend and repurchase signals differently 
from other market participants. 

10 For a formal equilibrium model of stock ownership and stock price, see Guenther and Sansing (2010). Motivated by 
recent empirical findings of a relation between tax-exempt ownership and dividend tax capitalization, they model and 
provide empirical evidence that dividend tax capitalization reflects all investors’ weighted average tax rates, where the 
weighting varies with the investors’ risk tolerances. An important difference between their study and ours is that they 
focus on prices and ownership, whereas we focus on payouts and ownership. 
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percent) exceeded the decrease in the capital gains tax rate (from 20 percent to 15 percent), the net 
effect of JGTRRA was to make firms that distributed profits mostly through dividends more 
attractive for individual investors than those that distribute profits mostly through share repur- 
chases. Assuming investors were holding the optimal portfolio (considering risk and taxes) before 
JGTRRA’s enactment, we predict that, after enactment, individual investors altered their portfolios 
to receive a higher percentage of their returns in the form of dividends. 


H1: Individual investors responded to JGTRRA by increasing their holdings in stocks that 
distribute larger proportions of their profits through dividends. 


H1 notwithstanding, it is important to recognize that investors cannot freely rebalance their 
portfolios. Aside from the commissions and other transaction costs that investors face on all trades, 
taxable investors pay capital gains taxes on any excess of the proceeds from the sale of the stock 
over the basis of the stock. Thus, some investors may accept an inferior portfolio, rather than incur 
the tax and non-tax costs of rebalancing their portfolio. This is one reason why we might not 
observe investors engaging in widespread rebalancing following JGTRRA. 


Firm Responses 

We anticipate that firms will respond to individual investors’ enhanced interest in dividends 
by increasing the portion of profits that they distribute in the form of dividends. Those firms 
wishing to retain or increase their holdings by individual investors are likely to distribute more of 
their profits as dividends, following the enactment of JGTRRA, Шап other firms. Unfortunately, 
we cannot observe the pool of future shareholders that firms hope to attract by adjusting their 
payout policy. Therefore, we look to their shareholder mix at passage and assume that firms with 
greater individual ownership at enactment would be more likely to alter their distribution policy to 
distribute more of their profits as dividends than those firms with less individual ownership. To the 
extent a firm's current investor mix is not a good predictor of managers' desired investor mix in 
the future, our tests are biased against finding a firm response to JGTRRA. 


H2: Managers responded to JGTRRA by distributing a larger portion of their profits as 
dividends. The extent to which managers increased their dividend percentage increased 
with the individual ownership of their firm. 


The tension in H2 arises from several reasons why managers might not have adjusted their 
dividend-repurchase mix in response to JGTRRA. First, the JGTRRA tax rate reductions were 
scheduled to expire in five years at the end of 2008 (later deferred to the end of 2010), and 2004 
Democratic Presidential candidate, John Kerry, pledged to restore the higher dividend tax rates for 
the two highest tax brackets, if elected. Since dividends tend to be sticky and the markets histori- 
cally punish firms for decreasing dividends, many firms may have chosen to leave their distribu- 
tion policy unchanged, delay any change until after the 2004 elections, or turn to one-time special 
dividends (which we ignore in this study). Second, dividends are purported to play an important 
role by alleviating asymmetric information through conveying private information to the market. 
Tax-motivated dividend changes might undermine this signal. Third, large increases in dividends 
could adversely affect the firm's compensation structure, particularly to the extent that the firm 
relies on stock options, which are not dividend-protected. Consistent with this deterrent to modi- 
fying distributions, Aboody and Kasznik (2008) find that firms that increased their dividends after 
JGTRRA modified their stock option and restricted stock compensation plans. In short, there were 
multiple reasons for firms to be hesitant about modifying their distribution policy following 
JGTRRA, even if individuals were seeking higher dividends than before the legislation. 
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Ш, RESEARCH DESIGN 

System of Regression Equations 

As discussed above, we jointly evaluate the changes in investor composition and distribution 
policy following the enactment of JGTRRA. We predict that individual investors rebalanced their 
portfolios so that a larger proportion of their shareholder income was in the form of dividends as 
opposed to share repurchases. We also expect that managers altered their distributions so that a 
larger proportion of their distributions were dividends and that this adjustment was increasing in 
the extent of individual ownership. 

To jointly evaluate the impact of investor and manager responses to JGTRRA, we estimate a 
system of four equations (variables are defined below): 


INSIDER = ад + орж POST + аг * DIV96 + оз * DIV% ж POST + ад + S&PRATING 


+ a5 + AGE + ав ж SP500 + o + LIQUIDITY + ав + BETA + œ + IRISK 
+ ао + МКТАРЈКЕТ + ауу * SALESGR + ол + R&DINT; (1) 


МОМЕХЕС = бо + 6) * POST + 6, ж DIV% + бу * DIV * POST + бүж S&PRATING 
+ 6; ж AGE + б * SP500 + 8; ж LIQUIDITY + 6, + ВЕТА + ду ж IRISK 
+ 610 * MKTADJRET + бүү * SALESGR + 912 ж R&DINT; (2) 


MF = y+ y, * POST + y, ж DIV% + уз ж DIV% ж POST + уд + S&PRATING + ys + AGE 
+ ув + SP500 + уд + LIQUIDITY + уд ж BETA + ys + IRISK + у ж MKTADJRET 
+ уп * SALESGR + yi + R&DINT; (8) 


DIV% = Ву + Ві + POST + В, * INSIDER + В, + INSIDER * POST + p4 * МОМЕХЕС 
+ В; + МОМЕХЕС ж POST + Вс ж МЕ + В, ж МЕ ж POST + By ж КЕ + By ж PERM 
+ Bio * TRANS + Ві; ж FCF + Ву; * DYIELD + Вуз + LEVERAGE + Від * SIZE 
+ Industry Indicators. (4) 


To capture investor responses, the first three equations regress the percentage of the firm held by 
three individual investor groups on the percentage of the firm’s payouts that are dividends (DIV%). 
There is one equation for each investor group: insiders (INSIDER in Equation (1)), other individu- 
als (МОМЕХЕС in Equation (2)), and mutual funds (МЕ in Equation (3)). The coefficients on the 
key variables of interest are аз, 63, and уз, respectively. The fourth equation in the system flips the 
direction of the association and regresses the percentage of the firm’s payouts that are dividends on 
each of the three investor groups. The fourth equation has three key variables of interest, one for 
each investor group in the period post-enactment: insiders (85), other individuals (85), and mutual 
funds (£). 


Dividends-to-Payout Ratio Variable 

The percentage of ће firm's payouts that are dividends (DIV%) is an explanatory variable in 
the first three equations and the dependent variable in the final equation. DIV% is a ratio where the 
numerator is the sum of dividends over an eight-quarter period and the denominator is the sum of 
dividends and share repurchases over the same eight quarters. There are two eight-quarter periods 
for each firm. The first eight quarters are the eight quarters immediately before the fiscal quarter 
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in which JGTRRA was enacted." The second eight quarters are the eight quarters immediately 
following the fiscal quarter in which the JGTRRA was enacted." Consequently, each firm has two 
DIV% measures: one before enactment and one after enactment. 

We aggregate distributions over a two-year period because, unlike regular, quarterly divi- 
dends, share repurchases are irregular. Therefore, focusing on the dividend-repurchase mix in a 
single quarter could introduce excessive noise. That said, in sensitivity tests reported later, we 
relax this aggregation requirement and report results on a quarterly basis. Inferences are largely 
unaltered. 

We can measure repurchases in two ways. One option is total share repurchases. Another 
option is net repurchases—i.e., total share repurchases less stock issuances.!? We use net repur- 
chases because we are interested in the cash that the firm could have distributed as dividends. 
Fama and French (2001) note that dividends cannot substitute for repurchases in many situations. 
Firms need shares for executive compensation, stock option exercises, stock acquisitions, and 
funding employee stock ownership plans, among other things. Thus, consistent with Fama and 
French (2001), we measure net repurchases as the change in treasury stock.!* If there is а net 
decrease in treasury stock, then we truncate our measure of repurchases at zero. For those firms 
that do not use the treasury stock method, we measure net repurchases as total repurchases from 
the statement of cash flows less decreases in preferred stock. 

For post-enactment observations in the first three regressions, we interact DIV96 with POST, 
a categorical variable that equals 1 for observations after the May 23, 2003 enactment (i.e., 
quarters after the second quarter of 2003). In these three tax clientele tests, positive coefficients on 
DIV% ж POST (the coefficients аге оз, &, and уз in the system of equations) can be interpreted 
as evidence that, after passage of the JGTRRA, individuals rebalanced their portfolios by shifting 
toward stocks where dividends constituted a larger portion of total payouts. We also include POST 
as a separate variable in each regression equation to capture any other temporal change. 


Investor Group Variables 

We employ three variables to capture the portion of the firm owned by individual investors. 
The first individual ownership measure, INSIDER, is the percentage of shares held by directors 
and officers, as reported in Thomson Financials Insider Filing Гага. Note that these shareholders 
play dual roles—as the managers setting distribution policy and as individual shareholders, often 





И We exclude the enactment quarter (May 2003) because it is unclear which tax regime managers were contemplating 
when they issued dividends and repurchased shares during that quarter. We treat the first quarter of 2003 as a pre- 
enactment quarter, even though the legislation was retroactive to the beginning of the year. The reason is that passage 
of the legislation was uncertain until Vice-President Cheney's tiebreaking vote in the U.S. Senate in May. Sensitivity 
tests, detailed below, provide assurance that this classification is appropriate. In addition, inferences hold if we exclude 
any 2003 quarters from the pre-enactment period. 

Blouin and Krull (2009) show that share repurchases rose in 2005 as firms enjoved a tax holiday for repatriating 

earnings from foreign subsidiaries. When we replicate our analysis excluding the 2005 quarters, inferences hold. 

13 For further detail, see the discussion in footnotes 5 and 6 of Boudoukh et al. (2007) and footnote 9 of Skinner (2008). 

М Using treasury stock to measure repurchases is not without limitations. As Fama and French (2001) point out, using 
annual changes in treasury stock will fail to match a repurchase in one year and its reissuance in another year. This 
problem is mitigated in our research design because we combine two years of activity into one observation. However, 
even aggregation over two years cannot fully eliminate the potential mismeasurement. 

15 The reporting of holdings of insiders is mandated by Section 16 of the Securities Exchange Act of 1934, which applies 
to every person who is the beneficial owner of more than 10 percent of any clase of equity security registered under 
Section 12 of the Exchange Act and each director and officer (collectively, “repcrting persons" or "insiders") of the 
issuer of the security. On a different ncte, conclusions do not change if we limit INSIDER to direct holdings, excluding 
shares held by family members, trusts and corporations controlled by the insider, and similar related parties. 
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with large stockholdings and suffering from inadequate diversification. 

The second measure of individual ownership is NONEXEC, which is intended to measure all 
individual holdings, other than those by insiders or through mutual funds. Ideally, we would 
measure the number of shares for which dividends and capital gains are expected to flow through 
to individual tax returns—i.e., those shares held by individuals or flow-through entities (e.g., 
mutual funds, partnerships, trusts, S corporations, or limited liability corporations) whose income 
is reported on U.S. individual tax returns. This ideal measure would exclude those holdings for 
which the dividends and capital gains do not flow through to individual tax returns, such as 
tax-exempt organizations, corporations, foreigners, and tax-deferred accounts (e.g., qualified re- 
tirement plans, including pensions, 401(k), and IRAs). Unfortunately, the ideal measure does not 
exist. Thus, as in Ayers et al. (2002). Ayers et al. (2004), Blouin et al. (2003), and Dhaliwal et al. 
(2003), among others, we use 13-F filings to estimate the percentage of the firm held by individual 
shareholders. NONEXEC is 1 less (1) the percentage of shares that institutional investors own, as 
reported in 13-F filings and collected by Thomson Financial’s Institutional Holdings database, (2) 
the percentage of shares held by non-officer/director beneficial owners as reported in Thomson 
Financial's Insider Filing data, and (3) INSIDER. 

The third measure of individual ownership, MF, is the percentage of the firm owned by 
mutual funds, as reported in 13-F filings and collected by Thomson Financial's Institutional 
Holdings database. As mentioned above, this is an imperfect measure of individual ownership 
because mutual funds include both investments that are subject to personal taxes and investments 
that are not subject to personal taxes. Sometimes the dividends and capital gains realized by 
mutual funds are taxed at the individual level. At other times, distributions to mutual funds are 
exempt because the shares are held in deferred tax accounts, such as 401(k) or IRAs. We include 
MF as an attempt to capture all shareholder income that is taxed on personal tax returns. However, 
we recognize that the unobservable measurement error in MF (arising from the inclusion of 
non-individual owners) may lessen its usefulness. 

Each of the three individual ownership measures serves as the dependent variable in one of 
the first three regressions. All three are explanatory variables in the fourth regression, which tests 
for firm responses. Positive coefficients on these variables are consistent with firms altering their 
payouts in response to changes in individual tax incentives. Specifically, a positive coefficient on 
INSIDER ж POST (Аз) in the fourth regression will be interpreted as evidence that, after passage 
of the JGTRRA, the percentage of payouts distributed as dividends was increasing in insider 
ownership. It seems likely that, when directors and officers hold large shares of a firm, payouts are 
likely to be particularly responsive to individual tax incentives. À positive coefficient on NON- 
EXEC * POST (s) in the fourth regression will be interpreted as evidence that, after passage, 
DIV% was increasing in the percentage of the firm held by non-executive shareholders. A positive 
coefficient on MF ж POST (8:) in the fourth regression will be interpreted as evidence that, after 
passage, DIV% was increasing in the percentage of the firm held by mutual funds. The measure- 
ment error in MF (arising from the fact that non-individuals invest in mutual funds) should bias 
the coefficient on MF * POST toward zero. 


Control Variables 


Theory is insufficiently rich to provide much guidance concerning the control variables in a 
system of equations where the dependent variables are investor composition and the mix of 


16 Brown et al. (2007) report that insiders were particularly influential among dividend initiators in 2003. They present 
evidence consistent with dividends crowding out repurchases in firms with large insider holdings. However, they find no 
such substitution or insider influence among companies that were paying dividends before JGTRRA, which is the group 
of firms that is the focus of our analysis. 
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dividends and repurchases. To our knowledge, no study to date models the non-tax variables that 
should vary with the dependent variables in this study. Thus, we control for a host of factors that 
have been found to be associated with the investor mix and the distribution mix. 

For the three regression equations testing for clientele effects, we rely on Bushee (2001), who 
shows that the level of institutional ownership is associated with firm value, and Del Guercio 
(1996), who documents that institutional holders tend to hold investments that are more prudent. 
Hence, we include a number of control variables in the investor holdings regressions to capture 
firm value and the relative quality of the investment. Specifically, we include SALESGR as a proxy 
for firm growth, defined as the average sales growth over the three previous years. We include two 
proxies for firm risk: beta (BETA), which is included to control for systematic risk, and the 
standard deviation of the prior year’s daily market model residuals (IRISK) to control for idiosyn- 
cratic risk. Market-adjusted returns over the prior year (MKTADJRET) is intended to control for 
firm performance, which has been found to be positively associated with institutional holdings. 
The S&P common stock rating (S&PRATING) and the number of years that the firm is covered by 
CRSP (AGE) are included to capture the relative quality of the underlying investment. The log of 
the prior year's average monthly volume divided by sbares outstanding (LIQUIDITY) is included 
as a control for liquidity because institutional holders prefer more liquid securities. We also 
include whether a firm is listed on the S&P 500 (5Р500) as a control because many index funds 
are required to hold these firms. Finally, we include R&D intensity (R&DINT), measured as 
research and development expenses divided by sales because Hessel and Norman (1992) report 
that some institutions are fixated on the R&D activity of the firm. 

Concerning the fourth regression, where the dependent variable is DIV%, we include various 
measures that are known to affect either dividends or repurchases, although sensitivity tests show 
that results are largely robust to the set of control variables. First, we include lagged retained 
earnings scaled by lagged total assets (RE). A firm must have earnings and profits (as defined in 
the tax law) for its distributions to be taxed as dividends. Unfortunately, earnings and profits are 
unobservable, found only in confidential corporate tax returns. 'Thus, we use retained earnings as 
a proxy for earnings and profits. If firms with low or no retained earnings have fewer distributions 
that qualify as dividends, then DIV% should increase in RE. Consistent with this expectation and 
liquidity constraints, DeAngelo et al. (2006) report that firms with low or no retained earnings pay 
fewer dividends. Next, we include earnings in the model. Jagannathan et al. (2000) and Guay and 
Harford (2000) report that dividends are paid from permanent earnings, whereas repurchases are 
paid from transitory earnings. Dittmar and Dittmar (2004) contend that both are paid from per- 
manent earnings, but agree that repurchases come from transitory earnings. Thus, we dichotomize 
earnings into a permanent component (PERM) and a transitory component (TRANS). We measure 
PERM with operating income and TRANS as the difference between net income and operating 
income, scaling both components by lagged assets.” Based on the conflicts in prior research, we 
make no prediction about the sign of PERM. However, we expect DIV% to decrease in TRANS. 
We also include a measure of the firm's payout capacity, free cash flow scaled by lagged assets 
(FCF). Dividends serve as a possible mechanism to reduce agency problems in firms with free 
cash flow (Jensen and Meckling 1976). Thus, we anticipate that DIV% is increasing in FCF. We 
include the lagged ratio of dividends to the market value of equity (DYIELD) to control for a 
firm's capacity to increase dividends. The higher the existing dividend yield, the less the firm will 
be able to increase its dividend in response to the passage of the JGTRRA. We expect DIV96 to be 
increasing in DYIELD. We add lagged long-term debt, scaled by lagged assets (LEVERAGE), to 


U Differences between permanent and transitory earnings include special items, other income, and discontinued 
operations. 
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control for cross-firm variation in capital structure. Finally, we include the natural logarithm of 
total assets (SIZE) to control for any size effects. We have no expectations about the sign of the 
LEVERAGE and SIZE coefficients. 


IV. RESULTS 

Sample Selection 

We begin our tests for investor and manager responses following JGTRRA by drawing an 
initial sample from the 14,122 firms in the Compustat database between the second quarter of 
2001 and the quarter preceding the one that includes May 2003. We then exclude (1) firms whose 
shares were not common or publicly traded, (2) firms that changed their fiscal year-end during our 
sample period, (3) financial institutions and insurance companies because regulatory constraints 
may inhibit management from altering the firm’s payout policy, (4) firms with missing Compustat 
information, and (5) firms not in existence at any time from July 1, 2001, to June 30, 2005. 

From the remaining 1,923 firms, we draw two balanced panels. Each firm must have one 
observation for the eight quarters preceding JGTRRA and one observation for the eight quarters 
following JGTRRA. Becanse DIV% is undefined if there is neither a dividend nor a repurchase, 
each firm must have at least one dividend or one repurchase, both before and after JGTRRA. The 
first sample (Dividend Payers Sample) is the 421 firms in the study that paid dividends sometime 
during the eight quarters immediately preceding JGTRRA and had either a dividend or repurchase 
(or both) during the eight quarters immediately following JGTRRA. The second sample (Dividend 
Payers and Repurchasers) is the 294 firms that both paid dividends and repurchased shares some- 
time during the eight quarters immediately preceding JGTRRA and had either a dividend or a 
repurchase (or both) during the eight quarters immediately following JGTRRA. 

Our definition of DIV% forces us to exclude firms that distribute no profits to shareholders. 
We further limit our tests to firms that paid dividends at least once during the eight quarters 
preceding JGTRRA. The reasons for these limitations are twofold. First, firms that were paying 
dividends before passage paid 97 percent of the dividends issued in the four quarters following 
enactment. However, much of the JGTRRA research (e.g., Chetty and Saez 2005; Brown et al. 
2007) has focused on the relatively narrow impact of JGTRRA on dividend initiation in 2003 
alone.'® Thus, we focus on dividend-paying firms as an under-studied, but more economically 
significant, portion of the economy. Second, dividend initiation and, to a lesser extent, a firm’s first 
share repurchase convey more and different information to the markets than simply altering the 
amount of an ongoing stream of dividends or repurchases. Thus, firms that have a history of 
shareholder distributions can likely modify the dividend-repurchase mix at a lower cost than firms 
that have never paid dividends or repurchased shares. In fact, it is possible that some managers 
that had never paid dividends before JGTRRA considered initiating payouts in response to the 
changed tax incentives associated with JGTRRA, but decided that the costs of initiation exceeded 
the benefits of attracting individual investors who were now seeking more dividend income. By 
limiting our analysis to dividend-paying firms, we ensure that the potential costs of dividend 
initiation do not affect our estimates of managerial responsiveness to the changed individual tax 
incentives under JGTRRA. This both increases the power of our tests and removes an additional 
factor for which we would need to control if we included non-dividend-paying firms. One down- 
side to limiting the sample to firms that were already paying dividends is that, if these firms were 
at their dividend capacity when JGTRRA was enacted, then they may have been unable to increase 


18 This is not to imply that initiations arising from the JGTRRA were unimportant. Initiators began issuing dividends that 
they might not have paid and likely will continue to issue dividends in the future because dividend payments are sticky 
and the market takes a dim view of dividend cuts. 
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their dividend payouts, even if they had wished to respond to the changed tax incentives for 
individual investors. | 

Another distinguishing factor about our analysis of ЈОТЕКА is that we examine investor and 
firm responses through 2005. Desai and Dharmapala (2011) study portfolio rebalancing in 2003 
only. Brown et al. (2007) and Aboody and Kasznik (2008) study changes in the dividend- 
repurchase mix for only two quarters after passage of the legislation. As Shevlin (2008) notes, 
examining such a short period requires quick response by firms and raises concerns about whether 
these studies miss important responses in 2004 and 2005. It seems unlikely that investors com- 
pletely unwound their positions and firms had fully adjusted their notoriously sticky dividends and 
repurchase policy in response to the tax rate reductions during the seven months following enact- 
ment. By investigating a longer window, we permit a more deliberate response and can calibrate 
how long it took for the largest dividend tax rate reduction in history to fully permeate the 
economy. 

Although we exclude non-dividend-paying firms from our primary tests, in the process of 
selecting the sample firms, we detect some initial evidence consistent with firms shifting from 
repurchases to dividends. Among the 1,923 firms from which we draw our samples, we find that 
145 companies initiated dividends after JGTRRA, while only 30 firms omitted dividends (a net 
increase of 115 dividend issuers). Meanwhile, 222 firms began repurchasing after enactment while 
370 companies stopped repurchasing (a net reduction of 148 repurchasers). We also find that 26 
firms both initiated dividends and ceased repurchasing after passage of the JGTRRA, while only 
three firms omitted dividends and began repurchasing. Furthermore, among 702 firms that repur- 
chased both before and after JGTRRA, 90 initiated dividends while only 10 firms omitted divi- 
dends, a net increase of 80 dividend issuers. On the other hand, among the 408 firms that paid 
dividends both before and after ЈОТЕКА, 46 began repurchasing after passage, but 58 stopped 
buying back shares, a net decrease of 12 repurchasers. All of these comparisons are consistent with 
firms shifting from repurchases to dividends following enactment of the JGTRRA. 


Descriptive Statistics 

Table 1 provides means and medians for the regression variables, both before and after 
enactment of JGTRRA, for both samples and reports whether the pre- and post-enactment means 
and medians are significantly different. The samples provide some evidence that dividends 
increased following JGTRRA (median total dividends and mean dividends per assets increased in 
both samples). However, there is less evidence that DIV% (і.е., dividends as a percentage of total 
shareholder distributions) increased after passage (only the difference in medians for the Dividend 
Payers and Repurchasers sample is significant). Recall, however, that we do not hypothesize about 
the overall impact of JGTRRA on dividends, repurchases, ог DIV%. Rather, we predict a more 
positive association between DIV% and individual stock ownership following passage of 
JGTRRA. 

Both samples show decreases in non-executive individual holdings (NONEXEC) and in- 


ІЗ Means are tested using а t-test of the means, The p-values for the medians are the larger p-values using the Wilcoxon 
and Kruskal-Willis tests. 

2) Special dividends are included in DIV%. Although the frequency of special dividends increased after the enactment of 
the JGTRRA, the relative magnitude of special dividends continued to be small as compared to regular dividends and 
repurchases. For our sample of 421 dividend payers and 294 dividend payers and repurchasers, special dividends 
constituted only 1 percent (0.4 percent) of aggregate payout made after (before} enactment of the JGTRRA. 
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creases in mutual funds holdings (МР) Among the control variables, free cash flow (FCF), 
volume (LIQUIDITY), idiosyncratic risk (IRISK), and returns (MKTADJRET) fell after enactment 
for both samples. Stock ratings (S&PRATING), beta (BETA), and sales growth (SALESGR) in- 
creased after passage for both samples. 


Preliminary Regression Results 

To provide some perspective for the simultaneous equation results that we report shortly, 
Tables 2 and 3 show separate OLS regression results for each of the four equations. Table 2 shows 
selected coefficient estimates from the first three equations, where investor ownership percentages 
are the dependent variables and DIV% * POST is the variable of interest. Table 3 presents results 


TABLE 2 
OLS 
Changes in Ownership by Insiders, Non-Executives, and Mutual Funds around JGTRRA 
Enactment 
Equation (2: NONEXEC Equation (3): MF 
Equation (1): INSIDER (Non-Executive 96 (Mutual Fund % 
(Insider % Ownership) Ownership) Ownership) 
Dividend Dividend Dividend 


Dividend Payers and Dividend Payers and Dividend Payers and 
Pred. Payers  Repurchasers Payers —Repurchasers Payers | Repurchasers 


Intercept —0.027 —0.040 0.273** 0.300** 0.414** 0.422** 
POST —0.013 —0.006 0.049 0.022 -0.020 -0.020 
DIV% —0.006 0.021 —0.008 -0.039 0.022, 0.003 
DIV% * POST (+) 0.048 0.038 —0.004 0.003 —0.026 —0.011 
S&PRATING -0.029 —0.018 —0.144 —0.020 0.002* 0.184* 
AGE —0.084** —0.063 0.064 0.072 -0.037 —0.076* 
SP500 -0.001 —0.011 0.030 0.023 —0.093** —0,082** 
LIQUIDITY —0.045** — —0.044** — —0.107** —0.108%*% 0.094** 0.089** 
BETA —0.011 —0.003 —0.082** —0.068** 0.030** 0.028 ** 
IRISK 2.211** 2.543** 5.831** 2.968** -3785"Ж 2 783ЖЖ 
МКТАРЈКЕТ 11.088 15.2815 14.383 12.398 — 7.804. – 9,263 
SALESGR 0.015 0.019 0.004 -0.019 -0.016 -0.004 
R&DINT 0.133 0.191 – 0.173 —0.423* —0.291**  —0255* 
n 842 588 842 588 842 588 





*, ** Indicate that the coefficients are significantly different from 0 at the 5 percent and 1 levels, respectively, using a 
one-tailed test where there are predictions; otherwise, using a two-tailed test. 

This table reports estimated ordinary least squares regression statistics for the 421 (294) firms that paid dividends (both 
dividends and repurchases) in the period surrounding the JGTRRA. The dependent variables are the percentages of 
ownerships by insiders, non-executives, and mutual funds. Observations aggregate eight quarters before and after enact- 
ment of the JGTRRA. POST is a categorical variable that equals 1 if the dividend/repurchase was declared in the eight 
quarters after 5/23/03, 0 otherwise. All other variables are defined in Table 1. 





21 According to the Investment Company Institute (see http:/Avww.ICLorg}, from 2001 to 2005 mutual fund ownership of 
all publicly traded equity securities increased from 21 percent to 25 percent. In addition, over our sample period, 
individuals increased the proportion of their financial assets held by mutual funds from 40 percent in 2001 to 47 percent 
in 2005. 
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TABLE 3 
OLS 
Changes in Dividends and Repurchases around JGTRRA Enactment 
Equation (4): DIV® 
Dividend Payers Dividend Payers 
Pred. (п = 842) and Repurchasers (п = 588) 

Intercept 0.95** 0.94** 
POST -0.03 -0.03 
INSIDER ~0.01 —0.02 
INSIDER * POST (+) . 0,28* 0.31* 
NONEXEC 0.02 —0.04 
МОМЕХЕС * POST (+) 0.02 0.06 
MF 0.03 —0.15 
МЕ * POST (+) 0.04 0.09 
ВЕ 0.11** 0.10** 
PERM —0.42** —0.28** 
TRANS 0.14 0.25 
FCF —0.16** —0.16** 
DYIELD 0.11 0.14 
LEVERAGE 0.08 0.01 
SIZE —0.02** —0.02* 
Industry Indicators Yes Yes 


Ж, ** Indicate that the coefficients are significantly different from 0 at the 5 percent and 1 percent levels, respectively, 
using a one-tailed test where there are predictions; otherwise, using a two-tailed test. 


This table reports estimated ordinary least squares regression statistics for the 421 (294) firms that paid dividends (both 
dividends and repurchases) in the period surrounding the JGTRRA. The dependent variable is the percentage of share- 
holder payouts that are dividends. Observations aggregate eight quarters before and after enactment of the JGTRRA. All 
variables are defined in Tables 1 and 2. 


for the fourth equation, where DIV% is the dependent variable and the ownership percentages of 
the three-investor groups are explanatory variables. 

Starting with Table 2, we expect a positive coefficient on DIV% ж POST, which will be 
interpreted as evidence that, following enactment, individual ownership increased for those firms 
that distributed larger portions of their profits as dividends. Using OLS, we find a positive coef- 
ficient on DIV% * POST (аз) when INSIDER is the dependent variable. However, the coefficient 
is not significantly greater than 0 at the 5 percent level (although using a one-tailed test, the 
coefficient is significant at the 10 percent level for the Dividend Payers sample). Contrary to 
expectation, three of the four coefficients on DIV% * POST are negative when the dependent 
variable is NONEXEC (65) ог MF (уз), although none are significantly different from 0. In short, 
the OLS results in Table 2 provide no evidence that investors rebalanced their portfolios following 
enactment, shifting toward stocks that distributed a larger portion of their payout as dividends. 

Among the control variables, two are significant in every regression. LIQUIDITY is negative 
when INSIDER or NONEXEC is the dependent variable and positive when MF is the dependent 
variable. The signs flip for ZRISK, as its coefficient is always positive when INSIDER or NON- 
EXEC is the dependent variable and always negative when MF is the dependent variable. These 
results are consistent with differences in the non-tax factors that matter to insiders and non- 
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executive individuals investing on their own account as compared to those that matter to mutual 
fund investors (see Bushee 2001). 

Table 3 presents OLS summary statistics from regressing the ratio of dividends to total 
payouts (DIV%) on the measures of individual ownership and control variables. We predict posi- 
tive coefficients on INSIDER ж POST (Вз), МОМЕХЕС + POST (Bs), and MF ж POST (f), 
consistent with firms altering their distribution policy to retain and attract individual investors, 
given the tax changes in JGTRRA. 

We find that INSIDER * POST coefficients are positive and significantly greater than 0 at the 
5 percent level. We interpret these findings as evidence that firms with large holdings by directors 
and officers distributed a larger portion of their profits as dividencs, after enactment, than they did 
before enactment. This is consistent with the individuals who set the dividend/repurchase policy 
(1.е., the directors and officers) modifying the distribution policy after enactment in a manner that 
is consistent with their own (and other individual) shareholders’ interests. The NONEXEC * POST 
and MF * POST coefficients also are positive, but not significantly greater than 0 at conventional 
levels.” These initial results are consistent with managers responding to the altered tax incentives 
of directors and officers, but not other individual investors. 

Among control variables, the coefficients on RE (retained earnings) are always significantly 
positive. The coefficients on PERM (operating income), FCF (free cash flow), and SIZE (total 
assets) are always significantly negative. 


Primary Regression Results 

We now discuss the results from estimating all four regression equations simultaneously. We 
find that coefficient estimates are larger and more significant using simultaneous estimation pro- 
cedures, consistent with joint evaluation of investor and firm responses leading to superior esti- 
mates than were detected using separate estimations. We infer from the stronger results that 
investors and managers responded concurrently to the tax changes in JGTRRA. 

Table 4 shows the results from estimating all four equations simultaneously using GMM. For 
brevity, we report only the coefficients for the key variables. Looking first at insider responses, we 
find that, when JNSIDER is the dependent variable, the coefficient on DIV% * POST (a3) is 
significantly greater than 0 for the Dividend Payers sample at the 5 percent level. (It is significant 
at the 10 percent level using the Dividend Payers and Repurchasers sample.) This finding is 
consistent with directors and officers boosting their holdings in their own firms, after enactment, if 
their firms were distributing relatively large portions of their prefits as dividends (i.e., it is con- 
sistent with a tax clientele effect)? The DIV% = POST coefficient of 0.11 implies that an increase 
of one standard deviation in DIV% ж POST, or 0.28, would boost insider holdings by three 
percentage points. This is a substantial increase in holdings by directors and officers because they 
held only 10 percent of the firm, on average, before JGTRRA. In contrast, as in Table 2, we 
continue to find no evidence that non-executive individuals (53) or mutual fund investors (уз) 
rebalanced their holdings by shifting toward firms distributing a higher portion of their profits as 
dividends, as compared with repurchases. Thus, we conclude tha: directors and officers were the 
only investor group changing their holdings in response to firm distribution policy. 





22 The МОМЕХЕС * POST coefficient is significant at ће 10 percent level using a one-tailed test in the in the Dividend 
Payers and Repurchasers sample. 

23 Since the percentage of stock held by insiders was declining during the investigation period (see Table 1), insiders in 
high-dividend-paying firms may not have been buying far more shares in their own companies. Instead, perhaps these 
insiders simply sold few shares in their own companies during this period while insiders in lower dividend-paying firms 
were unloading large holdings. 
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TABLE 4 


GMM Estimation of System of Equations 
Changes in Ownership by Insiders, Non-Execntives, and Mutual Funds and Changes in 
Dividends and Repurchases around JGTRRA Enactment 


Dividend Payers and 


Dependent Explanatory Dividend Payers Repurchasers 

Equation Variable Variable Pred. (п = 842) (п = 588) 
0) INSIDER DIV® * POST (+) 0.11* 0.10 

(2) МОМЕХЕС DIV® « POST (+) 0.09 —0.04 

(3) МЕ DIV% * POST (+) —0.02 0.01 

(4) DIV% INSIDER * POST (+) 0.32** 0.42** 
(4) DIV% NONEXEC * POST (+) 0.19* 0.18* 
(4) DIV® МЕ « POST (+) 0.28* 0.33* 


*, *+ Indicate that the coefficients are significantly different from 0 at the 5 percent and 1 percent levels, respectively, 
using a one-tailed test. 

This table reports estimated GMM estimates of the simultaneous equations’ regression statistics for the 421 (294) firms that 
paid dividends (both dividends and repurchases) in the period surrounding the ЈОТЕКА. The dependent variables are the 
percentages of ownerships by insiders, individuals, and mutual funds and the percentage of shareholder payouts that are 
dividends. Observations aggregate eight quarters before and after enactment of the JGTRRA. All variables are defined in 
Tables 1 and 2. Note that all control variables in Tables 2 and 3 are included in the estimation of the model but excluded 
from the table for brevity. 


Looking at manager responses, we find strong evidence that firms held disproportionately by 
insiders, non-executive individuals, and mutual funds began to distribute a higher percentage of 
profits through dividends following enactment. When the dependent variable is DIV%, the coef- 
ficients on INSIDER * POST (Вз), МОМЕХЕС ж POST (Bs) and МЕ * POST (;) are all positive 
and significant at conventional levels for both samples. These results are consistent with managers 
modifying their payout policy in response to their individual owners’ increased preference for 
dividends compared with repurchases, following passage of JGTRRA. In other words, as divi- 
dends became less tax-disadvantaged, firms appear to have provided more dividends as a percent- 
age of their total payout. Brown et al. (2007) report similar findings but only for insiders with 
large holdings in firms that initiated dividends in 2003.2* Our findings show that firm responses 
extended to a broader set of firms (non-initiating dividend payers) and individual investors. 

Analyzing the economic significance of each coefficient using a one-standard deviation in- 
crease, we find that the INSIDER * POST coefficient of 0.32 (0.42) with the Dividend Payers 
(Dividend Payers and Repurchasers) sample implies a four (six) percentage point increase in 
DIV%—..e., the percentage of payout distributed as dividends. With a pre-JGTRRA mean DIV% 
of 78 percent (68 percent), this suggests that, for every standard deviation increase in insider 
holdings, the ratio of dividend issuance to total distributions rose 5 percent (9 percent). Likewise, 
the NONEXEC * POST coefficient suggests a four percentage point increase in D/V% for both 
samples. Finally, the MF * POST coefficient of 0.28 (0.33) implies a four (five) percentage point 
increase in DIV% for the Dividend Payers (Dividend Payers and Repurchasers) sample. In short, 
all six coefficients across the two samples imply an economically significant change in distribution 
policy in response to altered investor preferences following JGTRRA. 


24 Brown et al. (2007) treat all 2003 dividend initiations as responses to JGTRRA, even though the legislation was highly 
controversial and passed by a single vote in the U.S. Senate in May. 
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We find that the GMM estimates are substantially stronger than those for OLS. This is 
consistent with our expectation that investor and manager responses are simultaneously deter- 
mined. Evaluating only investors or only managers potentially explains some of the weak and 
conflicting results from prior JGTRRA studies (see reviews by Dharmapala [2009] and Shackel- 
ford [2009]). If we were to stop with the OLS results, then we would erroneously conclude that по 
investors rebalanced their portfolios and that the firm responses were limited to those companies 
with heavy insider ownership. We also would substantially understate the economic significance of 
legislation as estimated by using the regression coefficients. 


Comparisons with Non-Event Periods 


This section repeats the analyses detailed above using non-event periods. If the 2003 findings 
are related to JGTRRA, then we expect the coefficients from the non-event years to be different 
from the 2003 coefficients. We repeat the tests as if the actual May 2003 rate reductions had 
occurred in May 1994, and include the eight pseudo “pre-enactment” quarters prior to the quarter 
that includes May 1994 and the eight pseudo “post-enactment’” quarters that follow the quarter that 
шиш May 1994. We then repeat this seven more times, using May of each year as the pseudo 
event. 

Table 5 reports the GMM summary statistics for the key coefficients for each year for the 
Dividend Payers sample.” For example, estimating Equation (1) for 1994, where INSIDER is the 
dependent variable, the DIV% * POST coefficient is —0.04. For Equation (2), when NONEXEC is 
the dependent variable, the DIV% * POST coefficient is 0.23. For Equation (3), when MF is the 
dependent variable, the DIV% * POST coefficient is 0.01. For Equation (4), for the single firm 
response equation where the dependent variable is DIV%, the INSIDER ж POST coefficient is 
0.15, the NONEXEC * POST coefficient is 0.03, and the МР * POST coefficient is 0.02. Unlike 
the 2003 results for which four of the six coefficients are significantly positive, none of these six 
coefficients are significantly different from zero. However, at the same time, none of the 1994 
coefficients are significantly different from their 2003 counterparts, suggesting that there may be 
nothing unique about 2003. Results for the remaining years are tabulated, and summary statistics 
for the coefficients from the non-event years are presented at the bottom of Table 5. 

The findings confirm the inferences drawn from the primary tests in Table 4. The 2003 
coefficient on DIV% ж POST, when INSIDER is the dependent variable, exceeds its counterpart in 
every year except 2001. Using a t-test (sign test), we can reject at the 0.02 (0.07) level the null 
hypothesis that the 2003 coefficient would be randomly selected from a distribution formed by the 
eight coefficients from 1994 through 2001. Finding that the 2003 association was different pro- 
vides further confirmation that insider holdings, following JGTRRA, were increasing in the divi- 
dend percentage of total payout. As further confirmation of the Table 4 results, we continue to find 
no indication of tax clientele movements by other individuals or mutual funds. 

Turning to the manager responses to JGTRRA, we find that the 2003 INSIDER * POST 
coefficient always exceeds its counterparts in every year from 1994 to 2001 and significantly so 
from 1997 to 2000. The 2003 NONEXEC * POST coefficient is greater than its counterparts in 
every year except 1996, and is statistically significantly larger in two of the eight years. The 2003 
MF * POST coefficient always exceeds its non-event counterparts, though not significantly. How- 
ever, for all three 2003 coefficients, we can reject at the 1 percent level, using a t-test, that the 
2003 coefficient would be randomly drawn from a distribution of the estimated coefficients from 


25 We stop with 2001 because any test of 2002 would result in the inclusicn of some of the period following enactment of 
the JGTRRA, invalidating its use as a non-event, comparison period. 
26 Tnferences are qualitatively unaltered if the Dividend Payers and Repurchasers sample is used. 
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the non-event years. We can similarly reject at the 1 percent (7 percent) level, using the sign test, 
for the INSIDER * POST and MF * POST (NONEXEC * POST) coefficients. Together, these 
results support our earlier conclusions that managers of firms with disproportionately large indi- 
vidual investor holdings modified their distribution policy more than did other firms. These sen- 
sitivity tests provide comfort about our earlier inferences, providing evidence that the associations 
in 2003 between distribution policy and individual ownership measures differ from the relations 
between those variables in the prior decade. 

There is a caveat for this robustness check. The capital gains tax rate fell from 28 percent to 
20 percent in 1997 without any change in the dividend tax rate. Using the same logic developed 
in this study, we would have predicted a shift from dividends to repurchases following that rate 
reduction, albeit of a lesser extent because the rate change was more modest. If that change in the 
distribution mix did occur, then this sensitivity test using non-event periods would bias in favor of 
our finding that the 2003 coefficients were more positive than those in previous years. The reason 
is that the tax incentives facing individual investors after the rate reductions in 1997 would have 
called for the mix of distributions to shift toward repurchases and away from dividends. In other 
words, 1997 was not really a non-event year. 

Therefore, if firms responded accordingly, then we would expect the coefficients on the 
individual tax measures for 1997 and perhaps 1998 to be negative. Consistent with firms modi- 
fying their payout policy if individuals held disproportionately large interests, the coefficients on 
INSIDER * POST, NONEXEC * POST, and МЕ ж POST (when DIV% is the dependent variable) 
turn negative for the first time in 1997. Thus, firm responses to changing tax incentives following 
the 1997 Act may have biased in favor of our rejecting the null hypothesis that the distribution mix 
did not change after 2003. However, when we exclude the 1997 coefficients in our comparison of 
the 2003 coefficients to those in the other non-event periods, we still find that the 2003 coefficient 
estimate remains significantly different from those in the other years. Furthermore, if managers did 
indeed respond to the 1997 tax cuts by increasing the repurchase portion of their distributions, then 
these results provide further evidence that managers consider the personal taxes of their share- 
holders in issuing dividends and repurchasing shares. 


How Quickly Did Investors and Managers Respond to JGTRRA? 


All of the tests reported to this point aggregate dividends and repurchases over the eight 
quarters before enactment and the eight quarters after enactment, creating two observations (be- 
fore and after JGTRRA) for each firm. As discussed above, the reason for aggregation is that, 
unlike regular, quarterly dividends, repurchases are irregular events, which can lead to highly 
volatile quarterly measures ої DIV%. In this section, we relax this restriction and treat each quarter 
as a different observation, resulting in 16 observations for each firm. The reason we shift to 
quarterly measures here is to enable us to pinpoint the time when investors and managers re- 
sponded to JGTRRA. 

To get GMM quarterly coefficient estimates for the same key six variables examined through- 
out this study, we suppress the intercept and include a categorical variable for each quarter from 
the earliest quarter (eight quarters before enactment—the quarter including May 2001) to the most 
recent quarter (eight quarters after enactment—the quarter including May 2005). We then interact 
the categorical variable for each of the 16 quarters with the six variables of interest. 

Table 6 reports 16 quarters of coefficient estimates for the six key variables for the Dividend 
Payers sample. We find little change across the quarters for the three tax clientele regressions. That 
is not surprising for estimates when NONEXEC or MF is the dependent variable because we find 
no evidence anywhere in this study that suggests that non-executive individuals or mutual fund 
investors rebalanced their portfolios in favor of stocks that pay a larger portion of their payouts 
through dividends. However, because we find some support for clientele effects among insiders, it 
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is a bit surprising that we cannot detect any cross-quarter changes in coefficient estimates. 

Conversely, for manager responses to JGTRRA, we find a sharp increase in the quarterly 
coefficient estimates during the second quarter following passage, which would be the quarter 
ending December 31, 2003, for most companies (1.е., those with March, June, September, and 
December year-ends). In eight of the nine preceding quarters, including the quarter immediately 
following the quarter of enactment (quarter +1), the INSIDER coefficient is negative. Beginning 
with quarter +2, the INSIDER coefficient is positive in all but one quarter, and is significantly 
positive in four of the seven quarters. Likewise, the quarterly pattern for the NONEXEC coeffi- 
cients shows a similar break in quarter +2. The NONEXEC coefficient is negative for all nine 
quarters before quarter +2. The coefficient turns positive in that quarter and remains positive in 
four of the remaining six quarters. Similarly, the MF coefficient is negative in six of the eight 
pre-enactment quarters and negative only once thereafter. The MF coefficient is actually positive | 
in quarter +1, but its coefficient triples from quarter +1 to quarter +2. 

We conclude from the quarterly findings that companies with disproportionately large indi- 
vidual ownership began to substantially adjust their payout policy in response to JGTRRA during 
the last quarter of 2003 and the changes were sustained for the remainder of the investigation 
period. A delay of one quarter before firms responded to the tax incentives of individual investors 
is consistent with firms being unable or unwilling to respond to the new tax rates in the quarter 
immediately following passage. Perhaps they had already made their dividend and repurchase 
decisions for the third quarter of 2003 by May 2003. However, by the last quarter of 2003, it 
appears that firms held disproportionately by individuals were beginning to shift at the margin 
from repurchases to dividends. 

This delay of one quarter may partially explain Brown et al.’s (2007) inability to find non- 
dividend initiators substituting dividends for share repurchases in 2003 and Aboody and Kasznik’s 
(2008) failure to link repurchases, stock options, and individual ownership. Both studies treat all 
dividends paid and shares repurchased in 2003 as post-enactment payouts.”” If few firms were 
adjusting their distribution policy before the last quarter of 2003 and, in fact, firms were making 
distribution choices before then, as though prior tax law applied (as implied by the negative 
coefficients), then it is understandable that their tests would have struggled to detect any move- 
ment from repurchases to dividends. 


Ү, CLOSING REMARKS 

This study extends our understanding of the effects of shareholder taxes on firm payout policy 
by estimating a system of equations to quantify the investor and managerial responses to the 
unprecedented 2003 cuts in dividend and capital gains tax rates. We hypothesize that, in response 
to the legislation, individual investors (the only ones affected by the rate reductions) rebalanced 
their portfolios to increase their dividend income, while firms boosted the portion of their profits 
that they returned as dividends. Comparing firm-level individual ownership and dividend- 
repurchase mix, before and after 2003, we find evidence consistent with directors and officers 


2 Our quarterly results, indicating no response to JGTRRA before the second half of 2003 and very little until the last 
quarter of 2003, raise concerns about the pre/post JGTRRA classifications in Brown et al. (2007), Chetty and Saez 
(2006, 2005), and Aboody and Kasznik (2008). Those studies treat all dividends paid and shares repurchased in 2003 as 
responses to JGTRRA, even those declared months before the May 23rd passage of the legislation, (e.g., Microsoft’s 
initial dividend announcement on January 7, 2003). By using the day that the dividends were paid, they even include 
some dividends declared in 2002, well Бегоге President Bush ever mentioned possible dividend tax relief in January 
2003. Brav et al. (2008) state that it is implausible chat firms were so clairvoyant that they declared tax-motivated 
dividends months before passage. Among other factors, President Bush's initial comments were vague and preliminary, 
and weeks passed before details of his proposal emerged. Furthermore, passage of the highly controversial legislation 
was uncertain until Vice-President Cheney cast a tie-breaking vote in the U.S. Senate to gain passage of the legislation. 
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rebalancing their portfolios, but not other individual investors. We also find evidence consistent 
with those firms with large individual ownership boosting the dividend portion of their total 
payouts. The payout adjustments began a few months following enactment of the tax cuts. The 
regression coefficient estimates imply that both the portfolio rebalancing and distribution policy 
changes following JGTRRA are economically significant. 

To our knowledge, this is the first study to evaluate investor and managerial responses to 
JGTRRA using simultaneous equations. We find that the results are much stronger than they would 
have been had we used standard OLS estimates of separate regressions, consistent with investors 
and managers acting concurrently. Both statistical and economic significance increased substan- 
tially using GMM. Furthermore, this is the most comprehensive analysis of the JGTRRA to date. 
Other studies focus on either investor responses or managerial responses; we study both. We 
analyze the set of firms that pay almost all dividends (і.е., non-dividend initiators), three different 
groups of individual investors (insiders, non-executive individuals, and mutual funds), and the 16 
quarters centered on passage of the legislation. The methodological and sample enhancements 
enable us to construct a more powerful test of the association between shareholder taxes and 
corporate distribution policy. 

Finally, our results suggest that additional research is needed to understand the role of insiders 
in the interaction of shareholder taxes and distribution policies. We find that insiders are the only 
individuals who rebalanced their portfolio in response to the rate changes. We also find that firms 
were particularly responsive to the changed tax incentives if directors and officers held large 
positions. These results complement Brown et al.’s (2007) finding that insiders played key roles in 
initiating dividends at the expense of share repurchases in 2003, which suggests that corporate 
governance needs to be introduced into analyses of the shareholder tax-payout choice. Questions 
that future research could explore include the following: To what extent are insiders motivated by 
their own personal tax considerations when setting distribution policy? How do the tax incentives 
of insiders affect the returns to other investors? From a distribution policy perspective, did other 
individual investors benefit from holding stocks in companies with high insider interests while 
non-individual shareholdings suffered? To what extent does compensation of executives with 
stock affect a firm’s distribution policy? To what extent do changes in shareholder taxes affect a 
firm’s value differently, depending on the extent of its insider holdings? We look forward to 
answers to these and similar questions in future studies. 
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ABSTRACT: Auditors are required to understand dynamic business environments as 
part of the performance of analytical procedures. Prior evidence, though, suggests that 
auditors have difficulty understanding such environments. This study reports an experi- 
mental investigation of techniques that help auditors to identify Incorrect management 
representations by developing expectations that are both accurate and adaptive to 
changing business conditions. My predictions suggest that analyzing a dynamic client 
environment through a systems perspective enhances Information-processing ability, 
which then improves both evidence discrimination and the assimilation of new audit 
evidence. Results reveal that participants taking a hollstic systems-based view of a 
cllent environment develop more coherently organized mental models that increase 
their likelihood of identifying management representations that are inconsistent with 
industry evidence. Furthermore, these participants more efficiently use their 
information-processing abllity, thereby improving assimilation of newly leamed өм- 
dence to understand how changing business conditions affect thelr Initial expectations. 


Keywords: analytical procedures; knowledge organization; leaming; mental models. 
Data Availability: Contact the author for data avallability. 


I. INTRODUCTION 
hen performing analytical procedures, auditors should treat any discrepancies between 
W pre-developed expectaticns and management representations as indicators of height- 
ened misstatement risk (Bell et al. 2005). Auditors develop expectations by understand- 
ing how a client’s economic environment (entity-level evidence) relates to that client’s financial 
statements (AICPA 2006; Knechel 2007). Because auditors compare their expectations about how 
financial and nonfinancial measures change over time to management representations of the same, 
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the complex and evolving nature of today’s business environment increases auditors’ needs to 
develop expectations that are not only accurate, but also adaptable to changing business conditions 
(Bell et al. 1997). The purpose of this study is to investigate techniques for examining entity-level 
evidence that improve analytical procedures performance in both corroborating management rep- 
resentations and accurately updating expectations when business conditions change. 

While current audit approaches and standards require auditors to understand complex business 
environments, laboratory results suggest that even experienced decision-makers have difficulty 
learning in, and making optimal decisions about, such settings (Sterman 1989; Diehl and Sterman 
1995). Consistently, auditors, particularly inexperienced auditors, have difficulty linking entity- 
level evidence to management representations (Curtis and Turley 2007), and this inability is often 
a significant component of audit failure (SEC 2003; Peecher et al. 2007). At least a part of this 
difficulty is likely due to human cognitive limitations inhibiting the auditor’s ability to build 
accurate and sufficiently detailed mental models that take into account how economic events flow 
down to affect the client's financial statements (Bell et al. 2002, 2005). Of particular difficulty is 
understanding the nonlinear movement of entity-level evidence. Since auditors develop expecta- 
tions by cognitively simulating the time-series behavior of how entity-level evidence relates to the 
financial statements, these cognitive limitations hinder auditors’ ability to develop accurate expec- 
tations that can be used as a benchmark for evaluating management representations. 

Auditing and management theorists suggest one way to improve performance is to train 
auditors to adopt a systems-based perspective when analyzing a client’s economic environment 
(Bell et al. 1997; Sterman 2000; Peecher et al. 2007). This systems view involves developing, and 
organizing, a mental model of the client environment by holistically evaluating causal relation- 
ships and how they change over time. Drawing on psychology theory, I predict that organizing a 
mental model using a systems view will increase information-processing ability, thereby improv- 
ing the accessibility of auditor’s pre-existing knowledge of entity-level evidence. This enhanced 
processing should lead to more accurate and adaptable nonlinear expectations. Without a systems 
view, cognitive limitations are expected to force auditors into a reductionist view that considers 
environment components in isolation and simulates more predictable, and often incorrect, linear 
expectations. 

Prior accounting research provides evidence that using systems-based concepts such as holis- 
tic templates (Choy and King 2005) and understanding circular causality (Hecht 2010) can im- 
prove decisions. However, these studies do not place the decision-maker in a complex setting that 
strains cognitive-processing resources and requires using pre-existing knowledge to simulate out- 
comes for evaluating financial information. Such an examination is needed since practitioners note 
that understanding and using knowledge from complex business environments is an essential step 
in solving many ambiguous auditing problems (PwC 2003). Also, no study shows how auditors 
learn, build, and use systems-consistent mental models of their clients or the cognitive factors and 
processes that moderate performance improvements. 

To test this prediction, I conduct a laboratory experiment in which novices perform an ana- 
lytical procedures task, requiring them to learn about a dynamically complex client environment. 
The combination of novices and a complex environment allows me to test the theories underlying 
this study by placing participants in a setting that would overwhelm their cognitive-processing 
ability unless they have a well-organized mental model of the client. I prompt participants to 
analyze the client using either a systems-based perspective, to enable a more coherent organization 


Mental models are holistic cognitive representations of a specific phenomenon and are constructed based on an indi- 
vidual's understanding of how pieces of information link and interrelate (Rickheit and Sichelschmidt 1999; Bell et al. 
2002). Mental models are comprised of one or many knowledge structures, and the model’s level of detail and accuracy 
is subject to both the individual’s understanding of the phenomena and cognitive-processing constraints. 
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of entity-level evidence, or a reductionist-based perspective, leaving the participants to organize 
such evidence intuitively. I then manipulate whether management provides an explanation for an 
account fluctuation that is either relatively more consistent or inconsistent with the entity-level 
evidence. Finally, to learn how auditors update their mental models for changing conditions, I 
introduce new entity-level evidence that should change the nature of the auditors’ existing expec- 
tations. 

Consistent with predictions, I find that systems participants provide more coherently orga- 
nized mental simulations of the environment’s causal relations. They perform these improved 
simulations without increasing their cognitive effort, suggesting that the systems view helps par- 
ticipants overcome the previously mentioned cognitive limitations by enhancing their information- 
processing ability (Clark et al. 2006). Systems participants are more likely than reductionist 
participants to incorporate key nonlinear industry attributes into their expectations, and are also 
more likely to identify management representations that are inconsistent with entity-level evi- 
dence. Finally, the systems participants who have coherently organized mental models are better 
able to more accurately update their expectations when presented with new entity-level evidence. 

This study extends the existing literature on improving knowledge acquisition and perfor- 
mance during analytical procedures. Prior studies show that auditors’ knowledge acquisition for 
analytical procedures is improved through feedback or increasing cognitive effort (Bonner and 
Walker 1994; Earley 2001; Moreno et al. 2007). However, in an audit setting, feedback is not 
always readily available and increasing cognitive effort is not always effective due to the com- 
plexity of some analytical procedures tasks. Consequently, audit firms and educators have ex- 
plored methods to improve auditor mental models, but these methods expect auditors to make the 
essential causal linkages and simulations of the client environment that prior laboratory evidence 
suggests is difficult to do without assistance (Bell et al. 1997; Bell et al. 2002). Indeed, previous 
studies show that such approaches can actually introduce biases into the judgment process 
(O’Donnell and Shultz 2005). I demonstrate that taking a systems view improves participants’ 
analytical procedures performance both initially (i.e., when gathering entity-level evidence and 
analyzing management representations), and when presented with new, incoming evidence. 

Furthermore, I provide insight into how auditors build mental models by examining the 
factors that moderate the relationship between the organization (structure) of auditors' entity-level 
knowledge and their information-processing ability. This relationship accounts for substantial 
variation in task performance. While Borthick et al. (2006) find that better-developed knowledge 
structures improve performance, there are few studies examining how auditors build these struc- 
tures and use them to form judgments. In addition, theoretical learning frameworks suggest that 
knowledge structure and processing ability are related (Kyllonen and Woltz 1989; Libby 1995), 
but to date, there is little empirical evidence of the true nature of this association (Ceci et al. 2003). 
I find that coberently organized knowledge structures enhance information-processing ability, 
which then improves the likelihood of accurately updating existing knowledge structures with new 
entity-level evidence. Understanding such a relationship allows audit firms and educators to tailor 
instruction and training to exploit such learning improvements. 

In Section II, I describe relevant research and develop hypotheses. Section III presents the 
experimental method used to test the hypotheses, and Section IV discusses the results of the 
experiment. Section V concludes. 


П. THEORY AND HYPOTHESES 
Background on Learning and Knowledge Acquisition 
Understanding how auditors learn and update expectations using a systems view requires 
examining the cognitive factors that facilitate learning entity-level evidence. Learning is the ac- 
quisition and organization of knowledge through working memory (Bonner 2007). Working 
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memory, the cognitive system for holding and manipulating information, is the intersection be- 
tween long-term memory and information received through the senses, and is a significant com- 
ponent of intellectual and problem-solving ability (Baddeley 1998: Conway et al. 2003). Knowl- 
edge is information that is stored and retrieved from long-term memory (Libby 1995; Bonner 
2007), and is comprised of content (specific information pieces) and the organization, or structure, 
of that content. When forming judgments, individuals retrieve knowledge from long-term memory 
and manipulate it within working memory. 

Humans have difficulty learning and performing complex tasks due to limitations within 
working memory. Individuals can cognitively process only a few information items at a time, and 
their ability to hold and manipulate items in working memory rapidly decreases as the items’ 
interactivity increases (Engle and Kane 2004). In other words, information with numerous causal 
linkages or dependences is difficult to learn because individuals cannot hold both the items and 
their interactions within working memory. One way to increase working memory processing 
capacity is by developing schemas, or clusters of information (Clark et al. 2006). Schemas form by 
connecting pieces of information into a coherent and meaningful structure, sometimes because of 
considering a single example problem (Ahn et al. 1992). Schemas are held in working memory as 
a single item. Organizing knowledge in long-term memory into schemas eases the strain on 
working memory by reducing cognitive load and therefore substantially increases cognitive- 
processing ability.” Improved schema development leads to better-organized knowledge structures, 
which are typically associated with improved decision-making performance (Sternberg 1997). 

Individuals learn by elaborating, or mentally simulating, how new information relates to 
existing knowledge in long-term memory (Bonner 2007). Mental simulations are cognitive герге- 
sentations of past or potential future events (Kahneman and Tversky 1982). Individuals simulate 
both by cognitively manipulating how new or existing variables affect an existing mental model 
and by examining the manipulation’s resulting output (Klein 1998; Sanna 2000). Therefore, fa- 
cilitating the simulation process through mental model development is an important element of 
learning. Improving auditors’ analytical procedures performance requires enhancing their likeli- 
hood of accurately simulating expectations of entity-level evidence based on information in their 
long-term memory. 


Using a Systems Perspective to Improve Auditors’ Mental Models 

To develop expectations about a client’s financial position, auditors should construct a suffi- 
ciently detailed and accurate mental model of how a client interacts with its business environment 
(Bell et al. 1997; Bell et al. 2002, 2005). The types of business environments modeled by auditors 
are complex systems that include features, such as feedback loops and time delays, that can 
produce nonlinear movement of key system components (Sterman 2000). For example, the time- 
series movements of entity-level evidence indicators, such as commodity prices and inventory 
balances, are the culmination of multiple causal effects having different strengths over time. 
Making accurate inferences and expectations about such indicators requires that decision-makers 
have mental models that include a coherent sequence of causal relations allowing the auditor to 
simulate the indicators’ movements. 

Decision-makers have difficulty learning in, and modeling, complex business environments 
with nonlinear behavior even when they are presented with all necessary information (Sterman 
1989; Sweeny and Sterman 2000). This difficulty results from working memory limitations that 
prevent individuals from combining and correctly organizing the necessary causal relations caus- 


? Cognitive load is the burden that performing a task places on а decision-maker’s cognitive system, and increases by 
specific aspects of a task, such as complexity (Paas et al. 2003). 
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ing such behavior (Sterman 1989}. Working memory constraints hinder retrieving and using 
knowledge in long-term memory while performing mental simulations. Consequently, when 
placed in an overwhelming business setting, decision-makers often resort to a heuristic-driven 
reductionist perspective that considers the system components in isolation and bases decisions on 
more predictable linear relationships (Sterman 2000). Reductionist-based thinking is a common 
cause of judgment error in complex business settings (Reason 1990; Sterman 2000). 

Auditors can reduce reliance on reductionist thinking by taking a systems-based perspective 
of the client environment (Richmond 1994; Peecher et al. 2007). Bell et al. (1997, 14) describes a 
systems view as one that “emphasizes basic principles of organization—how the parts аге inter- 
related and coordinated into a unified whole.” Such a perspective entails focusing on the causal 
relationships among system components as well as the nonlinear behavior resulting from these 
system interactions (Richmond 1994; Sterman 2000).? Auditors taking a systems perspective 
should more coherently organize the environment’s causal relations in their mental models, 
thereby enhancing their ability to develop causal schemas. Therefore, auditors adopting a systems- 
thinking perspective, and developing schemas of the various environment causal relations, are 
likely to process more information concurrently in working memory and thereby overcome some 
of the aforementioned cognitive constraints.’ These auditors’ mental model components should 
show a logical causal sequence that allows them to better understand the dynamic nonlinear 
features of a client environment. This improved coherent organization allows the auditor to more 
easily access knowledge from long-term memory and simulate how entity-level evidence affects 
the client. Because these auditors can better mentally simulate how economic events flow down to 
affect the client, they will more likely develop accurate expectations of industry dynamics. 


НІ: Participants using a systems perspective will develop expectations regarding entity-level 
evidence that are more accurate than will participants using a reductionist perspective. 


The following hypothesis examines the cause of these underlying improvements by investi- 
gating whether a systems perspective does in fact reduce auditors’ cognitive-processing constraints 
when performing mental simulations. Prior research shows that prompting auditors to perform 
simulation-like tasks can improve performance (Earley 2001; Moreno et al. 2007), but this re- 
search does not examine how to improve performance when working memory is overwhelmed and 
increasing cognitive effort is ineffective. As noted earlier, a systems view should help auditors 
organize information into schemas, thereby expanding working memory capacity. To examine the 
improvements resulting from expanded working memory, researchers measure the efficiency of 


Systems thinking is a byproduct of system dynamics, which is an interdisciplinary field that uses mathematical computer 

models to simulate complex quantitative problems that decision-makers cannot perform without such aid (e.g., hurricane 
forecasting, predicting spread of viruses). System dynamics views the world as a web of interacting complex relation- 
ships including feedback loops and time delays. Systems theorists have tried to improve individual decision-making by 
training people to adopt the same principles that make the system dynamic models successful (ie., understanding 
multiple causality, feedback loops, etc.). This training typically involves either building computer simulations or devel- 
oping skill sets that can recognize dynamic interactions. To date, there is limited laboratory evidence showing either the 
effectiveness of such techniques or an е сети way to implement such methods. 
A system is a “collection of parts that interact to function as a whole” (Bell et al. 1997, 14). As described in this paper, 
a systems view understands phenomena in terms of these interacting and interdependent parts. In contrast, a reductionist 
view understands phenomena as isolated individual components. I consider these views as parts of a continuum with 
system and reductionist endpoints. As individuals construct mental models to guide their viewpoints, they will fall 
somewhere on this continuum. Information-processing ability is one factor that moderates placement on this continuum 
and a lack of such ability pushes an individual more toward a reductionist perspective. In this study, I use the term 
“systems view” as a viewpoint that is relatively more toward the systems endpoint, and “reductionist view” as one that 
is relatively more toward the reductionist endpoint. 
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information processing (Paas et al. 2003). Processing efficiency is a byproduct of working- 
memory capacity and considers both performance and exerted mental effort concurrently. Infor- 
mation processing is more efficient if increases in task performance are higher than expected 
considering mental effort level, or if mental effort is lower than expected considering performance. 


H2: Participants taking a systems perspective will process information in working memory 
more efficiently when performing mental simulations than will participants taking a 
reductionist perspective. 


Improving Performance: Evaluating Management Representations 


During analytical procedures, auditors compare client-provided values, either account bal- 
ances or other amounts provided in explanations, to quantitative expectations of entity-level evi- 
dence (Koonce 1993; Hirst and Koonce 1996). Psychology theory suggests that when a client 
provides an amount, auditors will simulate a range of potential values to compare against that 
amount (Mussweiler and Strack 2001; Epley and Gilovich 2006). If the client-provided amount is 
inconsistent with the auditor’s simulated range, then the auditor discounts the representation’s 
credibility. Consequently, accurately simulating this range of potential values increases the likeli- 
hood of identifying management representations that are inconsistent with entity-level evidence. 

Prior literature examines this expectation violation phenomenon, but these studies either (1) 
rely on the auditors’ prior experience to understand entity-level evidence (Choo and Trotman 
1991; Earley 2002), or (2) provide participants with a means to examine concurrently entity-level 
evidence along with client-provided evidence (Earley 2001; Vera-Munoz et al. 2007). To date, no 
Studies separately examine the mental model and expectation development process and how they 
affect subsequent evaluation of client representations. Distinguishing between expectation devel- 
opment and evidence evaluation is important because auditors often rely on their prior knowledge 
for corroborating client-provided evidence, and extant literature shows that auditors can be influ- 
enced if they cannot discredit such evidence (Kennedy 1995; Earley et al. 2008). 

The aforementioned theory suggests that an ordinal interaction exists between client analysis 
perspective and management representation consistency with entity-level evidence. Systems audi- 
tors will be better able to mentally simulate a range of potential values for entity evidence and 
therefore distinguish between management representations that are consistent with such evidence 
from those that are not. These auditors are more likely to provide lower credibility assessments for 
inconsistent representations. Furthermore, when management representations are inconsistent with 
entity evidence, systems auditors should be more likely to recognize the inconsistency and provide 
lower credibility ratings than reductionist auditors. The following hypothesis sets forth the pre- 
diction: 


H3: Participants will evaluate the credibility of management representations at the lowest 
level when they use a systems perspective of entity-level evidence and when they re- 
ceive management representations that are inconsistent with that evidence; participants 
will evaluate the credibility of management representations at a higher level under all 
other combinations of analysis perspective and evidence consistency. || 


Improving Performance: Updating Expectations of Entity-Level Evidence 

In addition to improving processing ability, a coherently organized mental model provides a 
structure for assimilating and retrieving incoming information (Brewer and Nakamura 1984; 
Sweller et al. 1998). In an audit context, Bonner et al. (1997) find that relevant knowledge 
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structures in place prior to experiencing events aid in learning from those events. I extend this 
result by examining one potential component of these learning improvements, namely that orga- 
` nizing information reduces the constraints on working memory capacity and, hence, improves an 
individual’s understanding of how to accurately integrate new information to revise a knowledge 
structure.’ While theoretical frameworks suggest that knowledge structure and working memory 
interact in the learning process (КуПопеп and Woltz 1989; Bonner 2007), there is little empirical 
evidence showing how this process occurs. This study tests how organized knowledge improves 
mental simulation in working memory allowing for improved mental model building. 

A better-developed mental model established through a systems-thinking perspective likely 
improves auditors’ learning in a dynamic business setting for two reasons. First, the resulting 
improved simulation coherence should allow the decision-maker to effectively simulate how the 
various system components can affect each other in a logical sequence. Second, schemas of the 
causal interactions will reduce cognitive load and increase the likelihood of understanding how the 
new information affects the dynamic nonlinearities resulting from component interactions. Under- 
standing the environments’ dynamics allows decision-makers to manipulate how the new infor- 
mation affects the essential causal relations within their existing mental models. 

Auditors using a better-organized mental model with a systems-based perspective will be 
more likely to understand how new evidence affects the existing cause-and-effect relationships 
and resulting nonlinearities within their mental model. This understanding makes these auditors 
more likely to change their existing inferences regarding such evidence when they receive new, 
contradictory information (Bell et al. 2005). 


H4: When participants receive new entity-level evidence, those who have relatively more 
coherently organized mental models and take a systems perspective will most accurately 
update their mental models to modify existing expectations regarding entity-level evi- 
dence compared to all other participants. 


Ш. METHOD 

Participants 

The participants are 61 accounting students from various junior- and senior-level accounting 
courses at a large state university, randomly assigned to experimental conditions. The theories 
underlying this study apply to both expert and novice auditors, but for present purposes, using 
novices has distinct advantages. First, using student participants shows that less experienced 
auditor subjects garner process improvements through systems-thinking, contrary to recent critical 
claims that only experienced auditors can perform the complex business environment analysis 
required by professional standards (AICPA 2006; Curtis and Turley 2007). 

Second, this subject pool affords.a more powerful test of theory than a professional subject 
pool (Smith 1989; Friedman and Sunder 1994). Experimental tasks must strain working memory 
to reveal the differences between conditions (Tindall-Ford et al. 1997). The task I employ is more 


My predictions suggest that improved working memary processing ability is one component of mental model updating. 
I do not preclude that improved knowledge structures would benefit updating even if one held processing ability 
constant (e.g, an impoverished model might not have developed the links essential to understanding how the new 
information would affect the existing model) My experimental design addresses this issue by examining whether 
task-performance improvements are due to knowledge content differences alone and/or include an aspect of processing 
ability. 
Smith (1989) conjectures that the primarv purpose of laboratory experiments is to test theory and researchers should 
design experimental procedures to maximize this goal. Furthermore, Friedman and Sunder (1994, 21) state that experi- 
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likely to strain working memory, and to do so to a greater degree, for nonprofessional, student 
subjects than audit professionals because efficient use of working memory resources is a common 
attribute of expertise (Sternberg 1997). 

Also, because my experiment involves training in systems-thinking and the process of learn- 
ing entity-level evidence, using auditor participants with variable, specialized, pre-developed rou- 
tines for or disposition toward analyzing such information would not facilitate a clean mental 
model manipulation. Using student participants with less variable, less specialized, and more 
malleable knowledge structures, relative to professionals, allows me to effectively control and 
manipulate mental model development (Libby et al. 2002). Holding constant the participants’ 
pre-existing knowledge structures by using nonprofessional subjects—an approach consistent with 
previous research examining learning environments (Bonner and Walker 1994; Bonner et al. 
1997)—is conducive to identifying how knowledge organization and information-processing abil- 
ity interact to improve performance.’ 


Task 

The instrument contains tasks (see Figure 1) representing analytical procedures performance 
from initial evidence gathering and mental model building to subsequent management represen- 
tation evaluation (Koonce 1993; Bell et al. 2005). First, participants receive a tutorial describing 
how auditors use entity-level information for developing client mental models. Participants then 


FIGURE 1 
Overview of Experimental Procedures 


Тор: Systems-Based Condition 
Bottom: Reductlonist-Based Condition 





Packet A Packet B Packet C 
Examine Graph Stock 
Stock & Stock & Flow—p Bobavior for 
Training Model DDG & Feed 
Road Con Graph Stock. Meatal Analytical Poste 
Industry &- Bebavior for —p> Simulation ----- Review — —- Experiment 
Client Beef Prices Таж Task Questionnaire 
Overview Perform 
Business Bushmen 
Risk Training Risk Analysis 


This figure represents the temporal sequence of the experimental materials that the participants received. Each 
Packet was taken from the participants before they received the additional items (although they were allowed to 
use the training tutorial while completing Packet A). 


mental design should “sharpen the effects of focus variables and minimize blurring due to nuisance variables.” Con- 
trolling for pre-existing knowledge by using nonprofessional subjects supports this objective by minimizing any undue 
effects this knowledge could have on my dependent variables. 

In addition to these theory-testing advantages, this research pool is also sufficient for my study; auditor participants are 
unnecessary because no current theoretical or empirical evidence suggests that pre-existing knowledge would interact 
with the manipulated factors in such a way as to change the nature of the independent variables’ predicted influence on 
the dependent measures (Peecher and Solomon 2001). While the pre-existing knowledge structures of professional 
auditors could reduce the magnitude of the predicted effects by decreasing strain on working memory, there is no reason 
to suspect that these structures could also cause a predicted directional effect to reverse (i.e., a disordinal interaction). 
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opened Packet A, which contained materials for a fictitious hamburger food chain. These mate- 
rials include an industry and company overview characterizing current conditions in the beef 
industry (see Exhibit 1). The case describes how increased ethanol production causes the price of 
corn-based feed for cattle to increase, subsequently increasing beef prices. However, another key 
point in the case is that the use of an ethanol byproduct, called DDG, for cattle feed causes beef 
producers’ feed costs to decline after June of the period under examination because DDG costs 
only 60 percent of typical corn-based feed. See Figure 2 for a computer simulation of beef price 
behavior based on the facts in the case. The case states that the company's success ties heavily to 
the beef industry and instructs participants to develop expectations by inferring the time-series 
behavior of beef prices for the year under review. 

After turning in Packet A, participants applied their existing mental models to perform a 
mental simulation task in Packet B by writing a paragraph describing how an abrupt change in 
ethanol prices would affect the company's profits. After turning in Packet B, participants con- 
ducted an analytical procedures task in Packet C. This task shows that the company's profitability 
declined over the past year and provides both a gross profit analysis and an explanation from 
management targeting rising beef prices as the reason for the decline. The provided explanation is, 
however, incomplete, as beef prices did not steadily rise during the year, but declined after June 
because of implementing the DDG byproduct as feed. The participants assessed the credibility of 
management's explanation of the profit decline and provided their own estimates of year-end beef 
prices. Finally, participants received an additional paragraph that explained new facts regarding 
entity-level evidence and provided updated beef-price estimates considering those facts. 


Independent Variables 


The experiment employs a 2 X 2 between-subjects design with two manipulated independent 
variables: (1) industry analysis perspective (systems-based or reductionist-based) and, (2) the 
degree to which management representations regarding year-end beef prices are consistent with 
industry evidence (consistent at $125, or inconsistent at $165). | | 

The industry perspective variable provides conditions that aid participants in either develop- 
ing a systems view or resorting to a reductionist view. The systems-based condition provides 
training and materials helping participants to develop a mental model with a coherent structure of 
the causal relations in the case materials. Existing research does not provide a systems-based 
manipulation that actually improves quantifiable aspects of auditors’ mental models. Therefore, 
my systems manipulation uses stocks and flows, which systems theorists state is the most effective 
way to develop accurate mental models.” Business settings are comprised of multiple stocks and 
flows, and these items should help individuals simulate the nonlinear movement of system com- 
ponents. However, even experienced individuals have difficultly understanding these concepts 
without extensive training (Sterman 2000). Therefore, the systems participants in my study used 
methods that are difficult to learn without such training. Consequently, because my study utilizes 
a brief tutorial, I rely on three commonly used techniques by systems trainers to develop mental 


* The name of the hamburger food chain used in the experimental materials is suppressed due to the fact, unknown at the 
time of the experiment, that a real-life hamburger food chain with a similar name exists. That chain operates in a 
different part of the U.S. than the university where the participants were recruited, so I have no reason to believe that 
any participants associated the materials with the actual chain. 

Stocks are accumulations within a system, while flows represent the stock increases and decreases over time. To 
illustrate with a commonly used example, the water in a bathtub would represent a stock, whereas the water coming 
through the faucet and leaving through the drain are flows (Sterman 2000). A system's dynamic properties (feedback 
loops, time delays) are a byproduct of th» interactions among and behavior of stocks and flows. 
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EXHIBIT 1 
Beef Industry and Company Overview 


• Beef prices have increased in recent years. The price of beef as of January 1, 2006 was $100 рег 
hundredweight (cwt) of cattle and reached record levels later in the year. 
• The recent increase in beef demand is a sharp contrast to prior periods as the demand for beef was on 
the decline for the previous 20 years. Consumers moved away from beef to alternatives, such as poultry, 
that are less expensive and considered healthier. For example, in 1975 beef accounted for roughly 49 
percent of meat products consumption but by 1997 that share had fallen to 32 percent. This decline is 
due to the previously mentioned health concerns about beef as well as the increased prices of beef 
compared to cheaper alternatives such as pork and poultry. In addition, the beef industry has historically 
lagged behind both the poultry and pork industry in developing pre-packaged, pre-cooked products that 
allow consumers to reduce cooking time. 
In the past decade, health concerns by customers have made the industry focus on “natural” beef, which 
is becoming an expectation of a large and growing portion of the population. For example, consumers in 
some markets are willing to pay as much as 25 percent more for a round roast if it has a label 
containing the word “natural.” 
Profitability in the beef industry is declining due to rising production costs. Unfortunately, the long life 
cycle of cattle means that beef producers must make production decisions years in advance in order to 
respond to market conditions. Cattle have the longest life cycle of any livestock produced, since it takes 
cattle four years to reach maturity. Furthermore, the litter size for cattle averages only one calf per litter. 
Like all commodity markets, the price of cattle is dependent on supply and demand. In livestock 
markets, however, production cannot take place without a breeding stock. Producers must choose 
whether to send their stock to market or to keep animals on the farm to increase the future supply. 
Producers control the supply of breeding stock, and that stock is increased when producers withhold 
mature animals from market. Producers will increase their desired breeding stock (to increase future 
supply) when the expected markup ratio is high and will reduce their breeding stock when the markup is 
low. In the short term, the longer life cycle of beef reduces the producer’s ability to increase capacity to 
deal with rising demand. Industry experts believe it takes between three and four years for cattle 
producers to fully respond to price increases by increasing production capacity. 
Over the previous 100 years, the price of feed was relatively stable and predictable when adjusted for 
inflation. In the United States, almost 60 percent of corn production is used as animal feed. Recent 
trends in the corn industry have caused demand and prices for corn to increase. Ethanol production has 
increased the demand for corn, which in turn has increased feed costs for cattle producers. The price of 
corn affects beef and dairy production because added costs affect profit margins of beef producers. The 
ethanol industry's demand for feed grain will grow exponentially over the next five years increasing 
from less than 5 percent of supplies a few short years ago to perhaps 75—80 percent in 2009 if the 
United States produces ап 11-12 billion bushel crop. Typically any price changes in feed costs are 
reflected in beef prices within a week. 
Over the next few years, ethanol prices are expected to continually rise. The price of feed per bushel 
was $3.50 on January 1, 2006 and changed throughout the year. 
One consideration of the rdiet deua is the production of the Dry Distiller Grains (DDG) byproduct. 
This byproduct costs approximately 60 percent of what regular corn feed costs and can be fed to cattle. 
The availability of DDG as a source of feed is dependent ой the available supply. Ethanol producers did 
not start stockpiling DDG until the beginning of January and did not have enough supply to sell until 
approximately June 2006. 
Another potential effect of high corn prices is that ranchers could bring lighter cattle to market. 
Livestock weights have been 10 to 20 pounds lower than usual the past year. This will affect the 
quantity of beef on the market, wbich will mean less top-quality meat on the market, which will drive 
up beef costs. 


» 


Company Overview for 2006 


XYZ Company is a publicly held, regional fast food restaurant chain specializing in hamburgers. The 
Company has more than 600 locations and is located in ten different states in the southwestern United 
States. The corporate headquarters is located in Fort Worth, Texas. 


(continued on next page) 
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EXHIBIT 1 (continued) 


• XYZ Company is differentiated from its competitors by serving its hamburgers made-to-order. Each 
burger is cooked immediately after the customer orders, with a typical cooking time between 5-7 
minutes. Management believes this approach ensures a quality product and allo: ws the customers to see 
the difference in service and quality between XYZ Company and its competitors. 


* The company takes great pride in the low turnover of its employees. The corporate office does an annual 
employee survey to get feedback from its restaurant associates. Furthermore, past results have shown 
that those stores with the lowest turnover margin are the ones with the highest sales. Since 2002, the 
associate turnover rate has dropped by 92 percent. 


• Despite a history of profitability, management is concerned about the rising beef costs. Approximately, 
70% of the company’s sales are from beef products, which is substantially higher than most of their 
competitors that introduced chicken products years ago. Management believes these rising costs will 
reduce profitability, which will hinder their own ability to expand the menu by introducing a line of 
chicken products. 


models: (1) diagrams that visually display the causal interactions, (2) active learning techniques to 
ensure that participants focus on the necessary causal relations, and (3) decomposition of complex 
causal interactions. 

The systems-based participants began the experiment with a tutorial that discusses the nature 
and interaction of stocks and flows, provides examples, and asks participants to sketch how a stock 
balance would fluctuate over time. Packet A provides a stock-flow diagram, based entirely on the 
industry overview, showing how the major components of the beef industry interact (see Figure 3). 
Participants completed an active learning task by writing a sentence describing each interaction 
within the model. Finally, participants sketched the patterns of behavior for three stocks within the 
system that have a causal relationship (DDG byproduct, feed prices, and beef prices) using blank 
graphs similar to that shown in Figure 2. 

Participants in the reductionist-based condition used their own intuition to organize a mental 
model of the case materials. The reductionist participants received the exact same industry and 
company overview as the systems participants. While they did not have access to the stock/flow 
tutorial or diagram, I included steps to enhance their understanding of the case materials. First, to 
ensure that participants understood why constructing a mental model of industry evidence is 
important and a potentially complex task requiring the assimilation of numerous causal relations, 
I provided them with a tutorial on business risk auditing, drawing heavily on current auditing 
standards and methodologies (Bell et al. 1997; Bell et al. 2005; AICPA 2006). Second, I included 
an active learning task to ensure that the participants studied the industry overview thoroughly 
enough to construct a mental model of the client environment. Participants identified business 
risks from the industry overview and described how these risks affect the company. Finally, to 
reinforce the notion that environment components change over time, participants sketched out the 
behavior of beef prices over the year. 

Both perspective conditions are consistent with current professional standards and business 
risk auditing methods. The difference in each condition is in the level of aid provided to organize 
the multiple causal relations in the case. Note that the reductionist participants did not receive the 
stock-and-flow model, but the information in the industry overview described the facts needed to 
understand the interactions and nonlinear behavior within the industry assuming there were no 
working memory constraints. The reductionist participants received all the necessary information 
to understand the nonlinear movement of beef prices. Consequently, I expect them to understand 
any individual causal relations, but not be able to properly combine them due to working memory 
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FIGURE 2 
Computer Simulation of Beef Prices 





Jan Mar May July Sept Nov 


This figure shows a computer simulation of beef prices using the information from the industry overview 
(Packet A). This simulation model is a supply-and-demand market for commodities based on dynamic models 
provided in Sterman (2000). The model is coded to move to equilibrium in the absence of external forces as 
shown by the steady-state periods. The numbers above (1-6) represent features of the simulation used to code 
the students' graphs of beef price. A point was awarded if the graph exhibited evidence of the above features. 
The numbers represent the following: 


1. Тһе starting point for beef prices is $100 cwt. 

2. Beef prices should rise until approximately June. The rise is due to increasing feed costs as the prices 
of corn-based feed are rising due to the increased usage of corn for ethanol production. 

3. Decline in prices after June as beef producers begin to switch to the ethanol byproduct, DDG, which 
is 60 percent cheaper than regular corn-based feed. 

4. Prices should reach a law point in the periods following June. 

5. The market moves back to equilibrium following the decline in feed costs. This can be represented in 
an increase from the lowest point as long as it is not higher than the pre-DDG period. 

6. The final estimate should be lower than the highest point of the graph to reflect the decreased feed 
costs for beef producers. 


limitations. During the post-experiment questionnaire, I verified that participants in both condi- 
tions understood the basic causal relations related to beef prices. 

My second manipulated independent variable is the consistency level of management repre- 
sentations compared to entity-level evidence. The objective is to provide a representation that is 
judgmental and requires participants to simulate an outcome from their mental models to evaluate 
the representation. In the case, management explains that the company's deteriorated profitability 
stemmed from rising food costs due to increased beef prices. Management states that beef prices 
had risen to record highs over the past year, and provides a year-end beef price of either $125 per 
hundredweight of cattle (cwt) (consistent) or $165 cwt (inconsistent). Recall, that while partici- 
pants initially developed expectations, they had a framework (see Figure 2) with a lower boundary 
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of $20 cwt, a midpoint of $100 cwt, and an upper boundary of $180 cwt. I expect participants to 
use this graphical framework when simulating outcomes for representation evaluation. Therefore, 
the higher $165 (lower $125) price is less (more) likely to incorporate the DDG-driven decline in 
the second half of the year. As a result, it is more likely that the $165 ($125) representation is 
inconsistent (consistent) when considering the entity-level evidence. 


Control Variables 


Individuals have varying abilities to process and manipulate information in working memory. 
To control for these variations, participants completed the short-form Raven Advanced Progressive 
Matrices Test, measuring their ability to manage sets of information in working memory (Raven 
1976: Carpenter et al. 1990; Arthur and Day 1994). Additional control variables include self- 
assessments (using an 11-point scale) of mental effort on all packets, motivation on all packets, 
understanding of both the industry overview and the tutorial, and minutes spent on each packet. 


Dependent Measures 
Mental Model Development Measure 


The predictions rely on the systems manipulation helping participants to develop more coher- 
ently organized mental models. To measure the success of this manipulation, I calculate mental 
simulation coherence, which captures the causal sequence structure of a mental model. I examine 
the extent that participants’ mental simulation paragraphs have a logical causal chain throughout 
the paragraph. The paragraphs are coded based on the methodology used by Lesgold et al. (1988) 
to quantify experts’ problem-solving methods because this measure captures participants’ ability to 
perform a mental simulation that ties together components in a coherent progression without 
disconnections (Klein 1998). Participants list and then visually draw each attribute of the para- 
graph (і.е., each system component) on a graph. Exhibit 2 provides an example of the simulation 
paragraphs coding. The coherence metric is a percentage based on the number of system compo- 
nents connected by a single directional causal chain. This method ascribes a higher percentage to 
those simulations that tie together a larger number of components with a single directional causal 
chain. 


Hypotheses Dependent Measures 


To test H1, I evaluate the accuracy of participants’ expectations of beef prices. The partici- 
pants developed beef price expectations by sketching the time-series behavior of the prices for the 
period under review. I assign a point (maximum of 6 points) if the expectation sketches exhibited 
the same dynamic properties exhibited in a computer simulation (see Figure 2) using the facts and 
assumptions presented in the industry overview. Higher scores represent more consistency with 
the computer simulation,” 

If there are no significant differences between the industry-perspective conditions in regards 
to the amount of necessary information presented, then both conditions should have a basic 
understanding of the individual causal relations (i.e., the building blocks) driving beef prices. Any 
mental model improvements should be due to an inability to accurately organize these causal 


10 І coded the simulation paragraphs with the assistance of another graduate student, covering all information regarding to 
which experimental condition the participants’ packets were assigned. Both coders worked independently while coding, 
and mutually resolved any significant differences. The mean correlation index between raters for simulation coherence 
percentage is 0.85 (Kappa = 0.45, p < 0.001). 

11 A graduate student and I individually coded the graph scores and mutually resolved any significant differences. The 
mean correlation index between raters is 0.97 (Kappa = 0.81, p < 0.001). 
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EXHIBIT 2 
Example of Mental Simulation Scoring 


The items below represent an example of how the mental simulation paragraphs were coded. Each 
component within the simulation was assigned a letter. Then causal linkages were established based on 
how the participant described the relationship between components. ТЕе coherence measure was 
calculated based on the number of identified components in the system and the longest directional chain 
through the system. 


Number of Components: The total number of system components identified in the simulation (lettered 
items below). 


Longest Chain: The longest causal chain that exists throughout the simulation (numbered items below). 


1 1 y 
| Ф 
Ped = 


A. Increase in ethanol prices 

B. Increase in corn prices 

С. Increase in feed prices 

D. Rise in beef prices 

E. The company’s cost of production 

Е. The company's use of substitute products 
G. Decrease in beef consumption 

H. The company's profits 

Number of Components: 8 

Longest Chain: 6 

Simulation Coherence (Longest Chain/# of Components): 0.75 


relations, not from a lack of knowledge content of the causal relations. During the post- 
experimental questionnaire, participants answered multiple-choice questions concerning the rel- 
evant causal relations needed for developing accurate beef price expectations. 

To test H2, 1 employ a commonly used workload efficiency measure that establishes a zero- 
efficiency benchmark and bases performance on a certain effort level (Clark et al. 2006). Partici- 
pants provided a self-assessment rating of their mental effort level during the simulation paragraph 
task (using an 11-point scale). Such measures of mental effort are sensitive to variations in exerted 
effort (see Eggemeier and Wilson [1991] for review). I use simulation coherence as the perfor- 
mance measure in calculating processing efficiency because it represents the participants’ level of 
attainment during their mental simulations (Paas et al. 2003). 
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To determine processing efficiency, I first recalculate the mental effort rating and task perfor- 
mance measures as z-scores. Then, I calculate processing efficiency а5: 


ЕЕЕ= Р 
= р 


where EFF is the value of processing efficiency, Р is the performance z-score, апа К is the mental 
effort z-score. Both mental effort (x-axis) and performance (y-axis) are graphed on a Cartesian 
coordinate grid (see, e.g., Panel C of Table 3). EFF represents the perpendicular distance of a point 
on the axis P — R, which is the zero-efficiency line. In other words, on the line P — R, mental 
efficiency is zero (Е = 0) because performance is at the expected level given the amount of effort 
exerted. For example, higher mental effort results in higher performance, and vice-versa. If P -- R 
> 0, then EFF is positive, and if P — R < 0, then EFF is negative. Any point to the left of the 
zero-efficiency line (Е = 0) represents an increase in processing efficiency, and any point to the 
right represents a decrease in processing efficiency. Experimental conditions are evaluated based 
on their relative position on the coordinate grid. 

To test H3, 1 evaluate participants’ credibility assessments (using an 11-point scale) of man- 
agement representations regarding the company’s decline in profitability. In addition, after evalu- 
ating management representations, the participants provided a quantitative range (1.е., confidence 
interval) for their beliefs regarding year-end beef prices. This measure accomplishes two objec- 
tives. First, by analyzing whether the participants’ credibility assessments influence their ranges, I 
support H3 by showing that participants who recognize the inconsistencies incorporate this finding 
into their ranges rather than anchoring their assessment on management’s representation. Second, 
by showing that the participants’ initial ranges include their pre-developed expectations, I establish 
a benchmark to support my measures for expectation updating (H4), as described in the next 
section. і 

To test НА, I present participants with а paragraph including new information about the 


poultry industry. This new information is that poultry, unlike beef, cannot use the DDG byproduct _ - 


for feed. Correct assimilation of this information into an existing mental model shows that the 
demand for beef should rise because its substitute product, poultry, will not experience the DDG- 
related reduction in feed costs and subsequent price decline that beef will. The industry overview 
(Packet A) states that beef supply cannot increase in the short term. Quantitative ranges regarding 
beef prices (see H3 dependent measures) should shift upward at year-end, reflecting an increased 
demand for beef because of higher prices for chicken, a beef substitute. I verified this movement 
by adding these facts into a computer simulation. Participants provided another quantitative range 
(confidence interval) for beef prices after considering this new information. I expect systems 
participants with a coherent mental model to mentally simulate how the new evidence affects the 
existing model and increase their beef price estimates by shifting both ends of their quantitative 
range upward. I use an indicator variable to signify whether participants shift their ranges 
accordingly." 





12 This equation is based on the formula for calculating the distance from a particular point to the line ax + by + с 0 
in a coordinate system with coordinates (x, y) (Paas et al. 2003). 

3 Т do not examine the magnitude of the changes since there is no information in the case that suggests a specific dollar 
value change should occur. The dollar change is dependent on any assumptions made by the participant during the initial 
quantitative range and subsequent mental simulation during updating. Using dollar value of the confidence interval 
change as a dependent measure would slant the results in favor of those who made the largest dollar changes, whether 
correct or incorrect. -4r 


aA 
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IV. RESULTS 

Descriptive Statistics of Control Variables 

The systems participants took more time to complete both the tutorial (tss = 5.45, р = 0.001) 
and Packet A (155=2.87, р = 0.006) than did reductionist participants. This outcome is not 
surprising, considering that the tutorial presented system participants with complex material that is 
not a part of most educational programs. Consistent with the increase in time, the systems par- 
ticipants’ self-assessments of both effort level (ts = 2.97, p = 0.004) and motivation (tso = 2.36, р 
= 0.022) are higher than those of reductionist participants. This outcome is also intuitive because 
the systems-thinking material requires that participants learn less familiar and more complex 
concepts than the concepts learned by the reductionist participants. Both groups provide responses 
indicating that they understood the tutorial and overview, and there is no significant difference 
between conditions. There also are no significant differences for time and effort between industry 
perspective conditions for Packets B and C. 


Systems View Manipulation Check 


I expect that systems participants will generate mental simulations of entity-level evidence 
that are more coherently organized than those created by reductionist participants. As shown in 
Table 1, systems participants provide mental simulation paragraphs with a higher percentage of 
components held together by a single causal chain (F; 5; = 5.39, p = 0.012). This result suggests 
the systems intervention is effective in aiding how the systems participants organized the entity- 
level evidence in the case materials. "6 


Hypothesis 1 
H1 predicts that systems participants will develop more accurate expectations of beef prices 
than will reductionist participants. As shown in Table 2, Panel A the systems participants graph 
scores are significantly higher than those provided by the reductionist participants (ts; = 2.63, р = 
0.005). I collapse the individual expectations sketches for each condition in Panel C of Table 2, 
“which shows that systems participants better understand the nonlinear behavior of beef prices and 
anticipate the decrease in prices after June. A regression analysis (untabulated) reveals that a 
systems perspective significantly predicts lower beef price estimates at year-end (tss = —4.50, p < 
0.001). These results support H1. 


M The systems (reductionist) participants took 13.53 (8.23) and 20.21 (15.23) minutes to complete the tutorial and packet 
A, respectively. Using time as a control variable does not qualitatively change the reported results in this study. 
15 Further analysis of the simulation Gata (not tabulated) shows that self-assessed motivation increases the number of 
components listed (ts; = 2.28, р = 0.013), the number of causal links between components (ts = 2.03, р = 0.023), and 
the length of the longest causal chain (tsg = 1.80, р = 0.038). However, motivation does nor affect participants’ ability 
to tie all of the relevant components together into a coherent simulation (tg = 0.58, р = 0.283). This result supports the | 
notion that constraints in working memory capacity are the underlying cause of the systems participants’ improved 
simulations. 
Because coherence is a ratio, multiple sets of numbers could provide the same ratio even though they are tying together 
a different number of components (e.g., 3/4 and 6/8 will both equal 0.75). Since simulations with fewer components 
could be tied together with relatively little cognitive effort, I ran an additional analysis that excluded any observations 
with less than seven components (approximately the 25th percentile), When these items are excluded, the systems 
participants’ scores are still significantly higher (Е, де = 6.00, р = 0.009). 
Because effort level was significantly higher for the systems-based participants, the interaction between effort and 
industry analysis perspective is included in the model. I tested the graph scores for multicollinearity, because there is an 
expected level of dependency between the industry perspective condition and the self-assessments of effort level. I 
correct for the multicollinearity using the partial orthogonalization method. All remaining analyses with multicollinear- 
ity receive similar corrections. Due to the dependent nature of the variables, Y also performed a path analysis, finding 
& quantitatively similar results. 


The Accounting Review May 2011 
American Accounting Association 


How a Systems Perspective Improves Knowledge 933 





TABLE 1 
Manipulation Check: Mental Simulation Coherence 
Panel A: Mean (Standard Deviation) Simulation Coherence Scores 


Industry Analysis Perspective 
Dependent Measure Reductionist Systems 
Simulation Coherence Scores 0.60 0.70 
(0.18) (0.18) 
n = 30 п = 31 
Components used to calculate Simulation Coherence 
Number of Components 8.95 8.18 
(1.90) (2.30) 
Longest Chain 5.21 5.76 
(1.76) (2.31) 


Panel B: ANCOVA Analysis for Mental Simulation Coherence Scores 


Variable Еу] p-value 
Industry Analysis Perspective 5.39 0.012 
Effort (Packet B) 0.07 0.396 
Raven score 0.02 0.440 


Indicator (0/1) coding was used for Industry Analysis Perspective in this table (systems-based perspective = 1; 
reductionist-based = 0). 
All p-values are one-tailed. See Exhibit 2 for coding example. 
Variable Definitions: 
Simulation Coherence Scores = (Longest Chain/Number of Components); 
Number of Components = number of individual components identified in the simulation paragraph; and 
Longest Chain = longest causal chain that exists throughout the simulation paragraph. 


To determine if both conditions understand the basic knowledge content necessary to estimate 
beef prices, during the post-experiment questionnaire, participants answered two multiple-choice 
questions essential to recognizing the nonlinear behavior of beef prices. The questions concern 
how the DDG byproduct affects feed prices (first question), and how feed prices affect beef prices 
(second question). Approximately 71 percent of the systems participants correctly answer the first 
question compared to approximately 76 percent of the reductionist participants (Іза = - 0.42, р > 
0.500). Approximately 97 percent of both conditions answer the second question correctly (tsg 
= 0.05, p > 0.500). These results imply that, when prompted, the reductionist participants possess 
knowledge content of the general causal relations driving beef prices but are unable to incorporate 
this knowledge properly into their expectations. Such a result is consistent with existing research, 
finding that decision-makers typically can identify and understand individual cues but typically 
cannot combine them properly when working memory is overwhelmed (Sterman 1989; Diehl and 
Sterman 1995; Einhorn 2000). Furthermore, this result lends support for my prediction that the 
systems view improvements are from knowledge organization differences, not entirely from dif- 
ferences in essential knowledge content between the conditions. 


Hypothesis 2 
In H2, I predict that participants with a systems perspective will exhibit more efficient work- 
ing memory processing when performing a mental simulation than will reductionist participants. 
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TABLE 2 
H1: Graph Scores for Beef Price Expectations 


Panel A: Mean Graph Scores (Standard Deviation) of Beef Price Graph Scores 
Industry Analysis Guidance 





Dependent Measure Reductionist Systems 
Graph Score (max of six) 2.35 3.25 
(1.02) (1.32) 
n = 30 n = 31 
Panel B: Regression Results for Graph Scores 
Variable B SE t p-value 
Constant 1.23 0.88 1.39 0.085 
Industry Analysis Perspective | 0.87 0.33 2.63 0.005 
Effort (Packet A) 0.08 0.12 0.64 0.261 
Effort * Industry Analysis Perspective -0.08 0.24 —0.34 0.368 
Raven score 0.08 0.06 1.20 0.117 


Panel C: Graphical Representation of Expectations" 





Indicator (0/1) coding was used for Industry Analysis Perspective in this table (systems-based perspective = 1; 

reductionist-based = 0). 

All p-values are one-tailed. 

" The graph was calculated by collapsing the participant's beef price graphs for each condition. The resulting average 
values were input into a systems software program to show the discrepancies between the expectations for each 
condition. 


Variable Definitions: 
Beef Price Graph Scores = how many points the participants’ beef price expectations received when compared to a 
computer simulation of beef prices. See Figure 2 for the legend of the computer simulation 
of beef prices. A point was given if a participant’s expectation graph exhibited the properties 
shown in Figure 2; 
Effort = self-assessed mental effort level exerted in Packet A using ап 11-ро scale; and 
Raven = total number of correct questions on the Raven’s Progressive Matrices short-form test. 


The Accounting Review May 2011 
American Accounting Association 


How a Systems Perspective Improves Knowledge 935 


The participants’ self-assessed understanding of both the client overview and the tutorial serve as 
control variables because an understanding of these items is essential for constructing an accurate 
simulation. Furthermore, because the Raven score controls for innate working memory processing 
ability, including this score in my analysis highlights any processing improvements stemming 
from mental model organization. The results in Table 3, Panel B show that the systems participants 
have more efficient processing when constructing a coherent mental simulation (Е, 55 = 4.60, р = 
0.018). Panel C provides a graphical presentation and shows that the systems-based efficiency 
score is above the zero-efficiency benchmark (Е = 0) line, whereas the reductionist-based score is 
below the line. These results support H2 and are consistent with prior studies showing that 
increased use of causal schemas reduces working memory constraints and improves processing 
efficiency (Paas et al. 2003; Clark et al. 2006). 


Hypothesis 3 
H3 predicts that systems participants receiving inconsistent representations evaluate those 

representations as less credible than participants in the other three conditions. Table 4, Panel A 
provides descriptive statistics and shows a divergence of credibility assessments between the two 
industry analysis conditions. Panel B presents the conventional ANCOVA results controlling for 
effort level and Raven score. Because H3 predicts an ordinal interaction, contrast coding is the 
most appropriate method of testing the hypothesis (Buckless and Ravenscroft 1990). Contrast 
weights are +3 in the systems/inconsistent condition and —1 in the other three conditions. Panel 
C shows the planned contrast is significant since systems participants receiving inconsistent man- 
agement representations provide the lowest credibility ratings (Fi 53 = 7.12, р = 0.005). Follow-up 
tests of the simple effects show that systems auditors are able to distinguish between inconsistent 
and consistent representations (Е; 5з = 4.69, р = 0.017), and are more likely than reductionist 
auditors to recognize inconsistent representations (Е, 53 = 6.62, р = 0.001). Reductionist partici- 
pants do not distinguish between evidence consistency, suggesting that they have difficulty simu- 
lating how the explanation compares to their existing knowledge. These results support H3. 

‚+ НЗ implies that systems participants will fixate their judgments less on management repre- 
sentations, and instead rely more by their own pre-developed expectations. I perform a regression 
analysis (untabulated) examining how the participants’ credibility assessments affect their beef- 
price quantitative ranges. When the materials present inconsistent representations, the systems 
participants’ credibility assessments are a significant predictor of the upper bound (tag = 2.26, р = 
0.014). This result suggests that those participants recognizing the inconsistency between their 
pre-developed expectations and management representations are less likely to anchor their infer- 
ences about beef prices on management representations (i.e., they are more likely to shift their 
confidence intervals downward in relation to the inconsistent management representation). By 
contrast, the reductionist participants' credibility assessments do not have a consistently significant 
effect on the parameters of the range. These results further support H3. 


Hypothesis 4 


НА predicts that participants’ most accurate updating of their mental models upon receiving 
new entity-level evidence will occur when they have a coherently organized mental model and 
when they take a systems perspective of the industry. Table 5, Panel A shows that the interaction 
of simulation coherence and systems-based condition significantly predicts whether participants 
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TABLE 3 
H2: Processing Efficiency Results 
Panel A: Mean (Standard Deviation) Processing Efficlency Scores" 





Industry Analysis Guidance 

Dependent Measure Reductionist Systems 
Processing Efficiency Score —0.17 0.17 
(0.95) (1.05) 

n = 30 n = 31 

Components of Processing Efficiency Score 

Performance Z-score -0.29 0.29 
Mental Effort Z-score -0.50 0.50 


Panel В: ANCOVA Analysis for Mental Simulation Coherence 


Variable Fy 55 | p-value 
Industry Analysis Perspective 4.60 0.018 
Understanding—-ST Tutorial 143 0.119 
Understanding— BR Tutorial 1.15 0.144 
Understanding— Overview 0.28 0.298 
Raven 0.11 0.373 





Panel C: Graphical Representation of Results 





Indicator (0/1) coding was used for Industry Analysis Perspective in this table (systems-based perspective = 1; 

reductionist-based = 0). 

All p-values are one-tailed. 

* The Processing Efficiency Score is the relative processing efficiency based on both performance and effort. In this 
calculation, the simulation coherence score (see Table 2) was used for the performance measures. Effort was the amount 
of self-assessed mental effort level exerted in Packet B. 

Variable Definitions: 

Processing Efficiency Score = [(Performance Z-score — Effort Z-score)/Square Root of 2]; 

Performance Z-score = calculated Z-score based on the simulation coherence score; 
Mental Effort Z-score — calculated Z-score based on the self assessment of effort in Packet B; 

Understanding—ST Tutorial = self-assessed understanding of the stock and flow tutorial given in the systems thinking 

condition using an 11-point scale; 


(continued on next page) 
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Understanding—BR Tutorial = self-assessed understanding of the business risk tutorial given in the reductionist 
condition using an 11-point scale; and 
Understanding—Overview = self-assessed understanding of the industry and company overview given to all participants 
using an 11-point scale. 





increase both ends of the range (Wald x? 92 = 3-12, p = 0.039).18 І verify the robustness of these 
results by finding that the processing efficiency measure from H2 (see Panel B of Table 5) yields 
consistent results (Wald д2 527 4.36, р = 0.019). These results support на. 

In sum, the results of this study show that а systems view improves information-processing 
ability and therefore the likelihood of acquiring and updating both knowledge content and struc- 
ture. Most accounting studies assume that the effect of processing ability on knowledge acquisi- 
tion is relatively static and does not change over the short-run (Libby 1995; Bonner 2007), but 
studies outside of accounting call for research that considers the implications of more dynamic 
associations between knowledge and ability (Ceci et al. 2003). My results suggest that a more 
dynamic relationship exists in the short-term between one component of knowledge, structure, and 
one component of information-processing ability, working memory capacity. This reinforcing 
relationship between knowledge organization and processing ability is significant because ignoring 
it fails to account for substantial variation in task performance (e.g., understanding and modeling 
complex business environments). 


У, CONCLUDING REMARKS 

This study examines whether a systems-based perspective of a client economic environment 
improves novice auditors’ knowledge acquisition and subsequent analytical procedures perfor- 
mance. Auditing theorists suggest that training auditors to use systems-thinking skills can improve 
how they learn and use complex entity-level evidence that would normally overwhelm auditors 
(Bell et al. 1997; Bell et al. 2002). There is little empirical evidence showing the effectiveness of 
system approaches, and researchers and practitioners have struggled to find a successful and 
efficient way to implement systems skills due to the extensive training that is typically necessary. 

І conduct a laboratory experiment in which novice participants learn about complex entity- 
level evidence for use in analytical procedures. I find that participants taking a systems view of the 
client environment develop more accurate and coherent mental models than participants using a 
reductionist perspective. In turn, systems participants are more likely to identify management 





Ї# In Panel A, the Raven score significantly reduces the chances of increasing the confidence interval (Wald y7 = 5.97, р 
= 0,007), suggesting that participants with higher innate processing ability are more likely to incorrectly reduce the 
confidence interval. A further examination (untabulated) finds that the Raven score is significant for the reductionist- 
based condition (Wald x? „= 3.58, р = 0.029) but not for the systems-based condition (Wald x? 527 1.22; p = 0.135). 
To lend some insight into this result (untabulated), I find that the Raven score increases reductionist participants’ 
self-assessed confidence level in evaluating beef prices (tss = 1.46, р = 0.075) but decreases the systems participants’ 
confidence level {tss = - 1.44, p = 0.078). This distinction implies that those with higher “fluid intelligence” levels, but 
with an impoverished mental model, are more likely to be overconfident in their ability to understand and utilize 
industry evidence. Conversely, those with better-organized knowledge structures recognize the complexity of the prob- 
lem and are not overconfident in their ability to interpret it. This result supports the findings of Choy and King (2005), 
who suggest that decision-makers over-estimate their own innate abilities when solving dynamic problems. 

І also examine other attributes of the updated beef price confidence intervals as dependent measures. The interaction of 
simulation coherence and systems thinking is either significant or marginally significant when using a separate indicator 
variable for an increase in the lower bound (Wald X? 52 = 3.44, р = 0.032) and the upper bound (Wald X? ») = 2.43, р = 
0.059) of the confidence interval. With these two measures, neither simulation coherence nor systems thinking are, in 
isolation, significant. Finally, I examine whether there was a tendency to incorrectly reduce the confidence intervals, 
finding no significant results for any condition. 
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TABLE 4 
H3: Credibility Assessment of Management Representation 


Panel A: Mean (Standard Deviation) of Credibility Assessments 








Industry Analysis Perspective 
| Reductionist Systems Totals 
Consistent—$125 7.07 7.57 7.32 
(1.33) (2.06) (172) 
n= 14 n= 14 n = 28 
Inconsistent—$165 7.25 6.17 6.73 
(1.49) (1.53) (1.67) 
n= 16 n= 15 n= 31 
Totals 7.16 6.84 7.01 
(1.49) (1.91) (1.70) 
n = 30 п = 29 п = 59 
Panel В: Conventional ANCOVA Analysis 
Sum of Mean 
Variable Squares af Square Е p-value 
Industry Analysis Perspective (ТАР) 2.307 1 2.307 0.89 0.175 
Representation consistency (RC) 6.627 1 6.627 2.54 0.058 . 
ЈАР * RC 11.203 1 11.203 4.30 0.022 
Effort (Packet C) 13.959 1 13.959 5.36 0.012 
Raven score 0.000 1 0.006 0.00 0.497 
Error 138.031 53 2.604 
Panel C: Contrast Analysis and Follow-Up Simple Effects Test 
Contrast DF Е p-value 
Summary Hypothesis: Combination of inconsistency and 1,53 7.12 0.005 
systems-based perspective difference from other three 
conditions. 
Follow-Up Simple Effect Tests 
Simple effect of Industry Analysis Perspective given 1,53 6.62 0.001 
inconsistent management explanation 
Simple effect of consistent explanation versus inconsistent 1,53 4.69 0.017 


explanation given systems-based perspective 


Indicator (0/1) coding was used for Industry Analysis Perspective in this table (systems-based = 1; reductionist-based = 
0). Indicator (0/1) coding was also used for Representation Consistency (inconsistent = 1; consistent = 0). 
All p-values are one-tailed equivalent. 
Variable Definitions: 
Effort = the self-assessed mental effort level exerted in Packet C using an 11~paint scale; 
Raven = the total number of correct questions on the Raven’s Progressive Matrices short form test. 
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representations that are inconsistent with their expectations of entity-level evidence. In addition, 
the better-organized mental models of the systems participants provide more efficient use of 
working memory capacity, which aids the accurate updating of their expectations when in the 
presence of new industry-level evidence. These results are consistent with educational psychology 
findings that individuals can learn more effectively when they construct the incoming evidence as 
causal schematic structures (Clark et al. 2006). This study provides insight into how knowledge 
structure and processing ability interact to enhance learning, shortening the path to building 
auditors’ expert-consistent performance when performing complex audit tasks (Nelson and Tan 
2005). 

Professional standards require auditors to understand complex entity-level evidence (AICPA 
2006). Therefore, this study highlights a potential disconnect between what standards now require 
of auditors and what they are cognitively capable of doing. This disconnect suggests that new 
systems-based training methods could help auditors to learn better in the field and develop the skill 
sets needed to understand complex environments (PwC 2003; Peecher et al. 2007). Leaders in 
systems-thinking suggest that quantitative model building is the most effective way to fully learn 
and understand dynamically complex systems (Hannon and Ruth 1997; Sterman 2000). Future 
research could examine if new tools and decision aids, such as dynamic modeling, can help 
auditors accelerate learning about complex client environments. 


Limitations 

A potential concern with this study is whether my systems-based manipulation could be cost 
prohibitive in an actual audit because it would require the provision of pre-developed diagrams for 
each client. Recall that the purpose of this study is to examine whether and how systems tech- 
niques can improve audit quality, not to provide a specific methodology for firms. The systems 
manipulation is simply a tool to examine the improvements of a systems-consistent mental model. 
Therefore, while the diagram utilized in this study was necessary to help participants develop a 
systems view, I do not expect such a pre-developed diagram to be necessary in an actual audit. 
Systems advocates suggest the type of information provided in the systems manipulation could be 
either cognitively or formally built by someone properly trained in such skills (Hannon and Ruth 
1997; Sterman 2000; Bell et al. 2002). Whether auditors can effectively and efficiently actually 
build these systems-consistent mental models remains a question for future research. 
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ABSTRACT: Dichev and Tang (2008) document a dramatic decrease over the last 40 
years in the contemporaneous correlation between revenue and expense, along with 
an associated Increase in earnings volatility and a decline In earnings ‘persistence, 
suggesting a decline іп earnings quallty. We document that these changes are primarily 
attributable to an increase in the incldence of large special items. We then examine the 
extent to which this increase In special items Is due to either more frequent real eco- 
nomic events related to special item recognition or to the adoption of new accounting 
standards. Our evidence suggests that changes in the frequency of economic events 
associated with special items have played a more important and sustained role relative : 
to the role played by adoption of Individual accounting standards. Finally, we find that 
the changing incidence of these economic events is at least in part related to the 
well-documented increase In competition in the U.S. economy over the last four de- 
cades. | 


Keywords: earnings persistence; eamings volatility; matching; special tems. 
Data Avallability: Data are available from public sources as identified in the text. 
JEL Classifications: M47. 


| І. INTRODUCTION 
ichev and Tang (2008; hereafter, DT) provide important evidence on changes over time in 
the relation between contemporaneous revenue and expense. For the 1,000 largest U.S. 
firms from’ 1967 to 2003, they find a decrease in the correlation between current-period 
revenue and current-period expense and ап increase in the correlation between current-period 
revenue and expense from the prior and subsequent periods. These findings suggest a decline in 


- "matching," such that an increasing amount of expense is being recognized before and after the 


: period in which it affects revenue. DT suggest these changes could arise from either changes in 
‚ economic factors or changes in accounting standards, and provide indirect evidence that changes 
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in accounting standards are primarily responsible. In this study, we build upon DT by first iden- 
tifying which expense line items are responsible for changes in the revenue-expense relation. We 
then examine whether the responsible line items have been more affected by changes in certain 
economic activity or by changes in specific accounting standards. 

Understanding the factors leading to changes in the revenue-expense relation is important for 
researchers, financial statement users, and accounting standard-setters for two reasons. First, these 
changes have important implications for earnings quality. As modeled by DT and supported by 
their empirical results, reporting related revenue and expense in different accounting periods adds 
noise to the measurement of earnings. This noise increases earnings volatility and decreases 
earnings persistence, consistent with a decline in earnings quality (see, e.g., Dechow 1994; Francis 
et al. 2004; Dechow and Schrand 2004; DT). Thus, the results in DT suggest a prolonged decline 
in the quality of earnings. 

Second, identifying the source of these changes in earnings properties is important because 
the two explanations we investigate have different policy implications. If accounting standards are 
responsible for changes in the revenue-expense relation, then the resulting decline in the “useful- 
ness" of accounting information is in direct opposition to the objectives of the Financial Account- 
ing Standards Board's (FASB) conceptual framework. If this is the case, it would be important to 
understand which standards are most responsible and consider corrective action. Alternatively, if 
changes in economic activity are responsible, then standard-setters should consider this possibility 
when they evaluate current and future standards. 

То determine which expense items are responsible for changes in the revenue-expense rela- 
tion over time, we disaggregate total expenses into: cost of goods sold; selling, general, and 
administrative expense; depreciation; taxes; other income/expenses; and special items. Using a 
simple decomposition framework, we show that the decline in the relation between revenue and 
current expense is attributable primarily to a single income statement line item—special items. 
This line item consists mostly of asset impairments, restructuring charges, and gains/losses from 
asset sales. Not surprisingly, current-period special items have a much lower level of association 
with current-period revenue than other expenses. In addition, the “weight” of special items as a 
component of total expense has increased with the incidence of special items over time, decreasing. 
the relation between current revenue and total current expense. We find that the most important 
changes in revenue and expense patterns are substantially reduced (by 90 percent in many cases) 
by either (1) excluding firm-years that report large special items (those greater than 1 percent of 
total assets), (2) using operating expense after depreciation (which excludes special items) to 
define total expenses, or (3) adding back special items to net income. 

Next, we examine whether the increasing incidence of special items is due to changes in 
economic activity or changes in specific accounting standards. First, we provide evidence on 
whether the increasing incidence of special items is related to changes in the frequency of specific 
economic events associated with recognition of special items. We identify such economic events 
using five variables: negative employee growth; merger and acquisition activity; discontinued 
operations; negative revenue growth; and operating losses. These five variables represent real 
economic events whose frequencies are not likely to be influenced by accounting standards or by 
bookkeeping practices. Rather, they likely capture economic circumstances directly or indirectly 
associated with asset impairments, restructuring activities, and asset sales. We combine these 
variables into an index and show that this index both (1) increases throughout our study period, 
and (2) explains significant cross-sectional variation in the incidence of special items at the firm 
level. This finding provides strong evidence that the increase in special items is related to changes 
in the incidence of underlying economic events. 

Next, we search for accounting standards adopted during our study period that are related to 
specific types of special items and find relatively few standards of this type. Among those that we 
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find, many appear to be either (1) reactive to changes in economic activity and accounting practice 
rather than leading such changes, or (2) too late in the study period to explain increases in special 
items prior to their adoption. In addition, we find that total special items increased in periods when 
no new accounting standards were adopted. Moreover, targeted tests of specific special items 
provide little evidence that adoption of specific accounting standards affected the frequency of 
these special items. Only for asset impairments do we find that implementation of a standard was 
associated with an increase in special items after controlling for a general time trend and our index 
of economic activity. 

We also compare the effects of the specific accounting standards and our index of economic 
events on the increase in total special items. We find that changes in economic events play a more 
important role in the increasing incidence of special items over time relative to the role played by 
the adoption of specific accounting standards. 

Finally, due to the apparently important role of economic events, we examine whether in- 
creases in our index of economic events are related to increasing competitive pressures in the 
economy documented by prior studies. We identify two measures of competitive pressure: industry 
concentration ratio and average probability of bankruptcy for the industry. We combine these 
measures into an index of competitive pressure and find that this index is (1) increasing over time 
throughout our study period, and (2) positively associated in the cross-section with our index of 
economic events related to special item recognition. 

This study makes several important contributions. First, the results indicate that the deterio- 
ration in the contemporaneous relation between revenue and expense is due primarily to a dra- 
matic increase in the frequency of reported special items, not to systematic issues across multiple 
line items in the income statement. Second, we provide strong evidence that an increase in the 
frequency of economic events associated with special item recognition plays an important role in 
the increasing frequency of special items. Third, the results also indicate that changes in specific 
accounting standards related to special items play, at best, a secondary role in the increasing 
frequency of special items. Thus, a primary implication of our results for standard-setters is that 
they should be aware of the relation between changes in the economic environment and the 
increasing incidence of special items over time. 


Ц. BENCHMARK CHANGES IN THE REVENUE-EXPENSE RELATION 

In this section, we provide benchmark evidence consistent with DT that the correlation be- 
tween current-period revenue and current-period expense has declined over time. We begin by 
selecting a sample that is generally consistent with the sample in DT. First, we identify the 1,000 
largest firms each year from 1967 to 2005 on the basis of total assets. Like DT, we select relatively 
large companies to confine our focus to changes in earnings properties among economically 
significant firms. This selection also helps maintain comparability throughout the sample period, 
as results are unaffected by the proliferation of smaller public firms later in the sample period. We 
require firms to have data available to calculate average earnings over successive, ПОП- 
overlapping, two-year intervals ("two-year earnings") and the volatility of two-year earnings over 
the preceding ten years. We also require firms to have non-missing values for cost of goods sold, 
SG&A expense, income tax expense, and operating income after depreciation. We exclude finan- 
cial firms, due to the difficulty in interpreting conventional income statement components for these 
firms.! Finally, we eliminate observations in the extreme 1 percent of the distribution each year of 


1 When we include financial firms in our sample, the results for the main analyses below are inferentially identical. 
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two-year earnings, two-year revenue and expense, two-year earnings volatility, current revenue, 
current expense, last-year expense, and next-year expense.” The sample has 32,645 firm-year 
observations between 1967 and 2005. 

Next, we estimate the following cross-sectional regression by year, which is identical to the 
regression reported in DT (2008, Table 3): 


Rev, = Yo + уУ,Ехр_ + Узбриу + узЕхр| + бү. (1) 


Rev,, is total revenue for firm i for year t, and Exp,, is total expense, computed as revenue minus 
net income before extraordinary items for firm i in year t, divided by average total assets for year 
t. These, and all other variables used in the study, are defined in the Appendix. The coefficient on 
current-period expense estimates the extent to which current-period expense varies with revenue in 
the current period, and the coefficient estimates on expense in the prior period and the next period 
estimate the extent to which expenses are scattered to adjacent periods. 

Average coefficients are reported in Table 1, Panel A, for the first half (1967 to 1985) and 
second half (1986 to 2005) of the study period. The average coefiicient estimate on contempora- 


TABLE 1 
Properties of Earnings-Related Variables over Time 
Panel A: Relation of Revenue to Lagged, Current and Future Expense 


Ехр!-1 Exp, EX Prsi 
1967-1985 0.002 1.032 —0.030 
1986-2005 0.089 0.895 0.025 
Difference нэ 0.087*** —0.137*** 0.055*** 


t-statistic (5.98) (—5.54) (4.04) 


Panel B: Volatility and Persistence of Earnings 


Volatility Persistence 
1967-1985 0.022 0.805 
1986-2005 0.036 0.680 
Difference 0.014*** —0.125*** 
t-statistic (9.77) (-6.32) 


Ж ЖЖ ЖЖЖ Represent significance at the 10 percent, 5 percent, and 1 percent levels, respectively, in two-tailed tests. 
For difference calculations only, t-statistics are in parentheses below the coefficient estimates. 


This table presents properties of earnings-related variables between two time periods, 1967-1985 and 1986-2005. Panel A 
presents average annual coefficients from estimating the following model each in both time periods: 


Model 1: Кеш, = Yo + Y Exp, + HED + узЁхр t Е. 


Panel B presents average firm specific volatility of Earn and average coefficients (persistence) from estimating the 
following model each year in both time periods: 


Model 2: Eam, = y+ у,Еат ‚+ £j. 
Variables are defined in the Appendix. 





2 These eliminations follow the sample selection procedures in DT. When we do not eliminate these observations, the 
decrease in the contemporaneous correlation between revenue and expense is somewhat more pronounced. 
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neous expense is 1.032 (0.895) in the early (late) period, and the difference of 0.137 is highly 
significant (p < 0.01). We also report increases in the lag (lead) coefficient from 0.002 (—0.030) 
in the early period to 0.089 (0.025) in the late period. 

As described by DT, the decline in the contemporaneous association af revenue and expense 
is likely to have implications for the volatility and persistence of earnings, which in turn have 
important implications for the quality of earnings. Specifically, scattering expenses related to 
current revenue to other time periods adds “noise” to reported expenses, resulting in more volatile 
and less persistent earnings. To see this, we follow DT and compute the firm-specific volatility of 
earnings (as measured by its standard deviation) over rolling five-year windows, and then average 
this measure across firms each year and then across years within each sub-period. We also follow 
DT and estimate earnings persistence using the following regression: 


Earn; = Yo + y,Earn,, + ё. (2) 


The results for both earnings volatility and earnings persistence are presented in Panel B of 
Table 13 The volatility of earnings increases from 0.022 in the early period to 0.036 in the later 
period, and the change is significant (p < 0.01). The earnings persistence coefficient (y, from 
model 2) decreases from 0.805 in the early period to 0.680 in the later period, and this difference 
is also significant (p < 0.01). These results indicate that the decline in the contemporaneous 
relation between revenue and expense documented in Panel A of Table 1 has produced an increase 
in the volatility of earnings and a decrease in the persistence of earnings from the early years to the 
later years in our study period, consistent with a decline in earnings quality (see, e.g., Dechow 
1994; Francis et al. 2004; Dechow and Schrand 2004). | 

Overall, the results reported in Table 1 are virtually identical to those reported by DT (2008, 
Tables 3-5). In the next section we examine which income statement line items are responsible for 
the decline in the revenue-expense relation documented in Table 1. 


Ш. DISAGGREGATED EXPENSES AND THE REVENUE-EXPENSE RELATION 

We begin our analysis with a decomposition of the change in average coefficients from 
Equation (1) between the early period and the late period. To do this, we divide total expense in 
Equation (1) into six components: cost of goods sold (COGS), selling, general, and administrative 
expense (SGA), depreciation expense (DEPR), tax expense (ТАХ), other expenses (OTH), and 
special items (51). SGA includes R&D and advertising expense. Estimating Equation (1) implicitly 
constrains all of the coefficients on these six components that comprise total expense for a given 
year to be identical. Alternatively, the following regression allows the coefficients on these six 
components for the current year to vary freely: 


Кери Yor NEP it-1 + Ух COGS, Куб СА, + уу DEPR;, + угдГАХ,, + 9, 0TH;, 
+ уза + УЗЕхр н + Ent (3) 


A decomposition similar to that in Kee (2009) demonstrates that the annual coefficient esti- 
mate for Exp, (%) in Equation (1) is a weighted average of the parameters оп the individual 
component estimates (75, through yzf) from Equation (3).* Thus, the decline we observe in the 
coefficient estimate on aggregate expense may occur for one or both of two main reasons. First, it 


3 To mitigate the influence of extreme observations in our volatility and persistence tests, we eliminate observations in the 
extreme 1 percent of the distribution each year of earnings volatility and current- and prior-year earnings. 

4 The decomposition in Kee (2009) is limited to two components that are assumed to be positively correlated. Our 
analysis is based on six components whcse pair-wise correlations may be either positive or negative. A proof that the 
analysis in Kee (2009) extends to our case without loss of generality is available from the authors on request. 
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is possible that the coefficient estimates for some (or all) of the individual expense components 
decline between the two periods. Second, it is also possible that the “weights” of the individual 
components have shifted over time, away from components with larger coefficient estimates and 
toward components with smaller coefficient estimates. As demonstrated by Kee (2009), the 
weights are related to the importance of a particular component—specifically, how much of the 
variance of total expense is attributable to each expense component—and must sum to 1 by 
construction." 

Table 2 summarizes the changing coefficients and weights of the expense components. The 
first set of rows provides the average coefficient estimates for the expense components for the 
early and late periods as well as the change between periods. The next set of rows provides the 
average weight for each component for each period as well as the change between periods. It is 
clear from Table 2 that special items has the lowest average coefficient estimate in both periods 
and that its average coefficient estimate has declined the most from the early period (0.483) to the 
late period (0.197). At the same time, we observe a dramatic shift in weight toward special items 
(from 0.009 to 0.129) and away from high-coefficient components like COGS and TAX. The 
increased weight on special items is consistent with prior studies that document an increase over 
time in one-time, special charges such as asset write-offs, asset sales, and restructuring charges 
(Elliott and Hanna 1996; Collins et al. 1997; Fairfield et al. 2009). 

















TABLE 2 
Decomposition Analysis 
Current Year Expense 
COGS SGA DEPR TAX 51 ОТН 

Coefficients 1967-1985 1.004 1.017 1.058 1.891 0.483 0.584 
1986-2005 0.996 1.030 0.853 2.123 0.197 0.480 
Difference -0.009 0.014 -0.205 0.233 —0.286 —0.105 
Weights 1967-1985 0.834 0.106 0.006 0.039 0.009 0.007 
1986—2005 0.711 0.122 0.013 0.011 0.129 0.013 
Difference —0.122 0.016 0.008 —0.027 0.120 0.006 


We decompose the aggregate coefficient estimates on current expense from Panel A of Table 1 into six income statement 
components and their respective weights, using the following model: 


Model 3: Revy = + УЁхр + 794СОС5,, + љь5СА,, + ъ.РЕРК,, + уа TAX, ye OTH,, + у, WEP 


te 


Kee (2009) demonstrates that the coefficient on current expense in Panel A of Table 1 (44) is simply a weighted average 
of coefficients on the expense components (55, through уу) in (3). The weight for any component, equals Cov(Exp, 
component)/Var(Exp), where Exp is total current expense (Kee 2009). 


5 The weight for any component, equals Cov(Exp , component) / Var(Exp), where Exp is total current expense (see Kee 
[2009] for a proof). This is the coefficient one obtains from an OLS regression of component, on total expense. Since 
Var( Exp) = £, Cov(Exp , component), the weights must sum to 1. 

$ A recent study by Cready et al. (2010) documents that the persistence of special items varies positively with the 
frequency of reported special items in the prior three years, indicating that for some firms these nonrecurring items are 
more recurring than for other firms. However, their evidence indicates that even for firms with five or more special items 
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These results are consistent with the decline we observe for the coefficient estimate on 
contemporaneous expense in Equation (1)—the low (and decreasing) coefficient estimate on spe- 
cial items increases in importance (weight) and drags down the coefficient on total expense. In 
contrast, the results for the other expense components are not consistent with the decline in the 
coefficient estimate on contemporaneous expense in Equation (1). For the other expense compo- 
nents, either the change in the estimated coefficient from the early period to the late period is very 
small or increasing (COGS and SGA) or the weight of the component is low in both periods 
(DEPR, ТАХ, and OTH)? 

We perform four analyses to corroborate the critical role of special items in changes in the 
revenue-expense relation and the volatility and persistence of earnings reported in Table 1. First, 
we repeat all analyses in Table 1 after deleting observations with a large special item that would 
affect any of the results, where “large” is defined as 1 percent of average total assets for the year? 
The results, which are reported in Table 3, Panel A, demonstrate the substantial effect large special 
items have on the earnings properties. After excluding observations with large special items, the 
coefficient estimate for current-period expense from Equation (1) is not significantly different 
between the early and late periods (p — 0.78), as compared to a significant decline of 0.137 in 
Table 1 (p « 0.01). In addition, between the early and late periods volatility (persistence) increases 
(decreases) by only 0.002. (0.003), as compared to 0.014 (0.125) in Table 1. Although the increase 
in volatility remains significant at the 0.01 level after excluding observations with large special 
' items, the magnitude is only 14 percent of the magnitude reported in Table 1. 

As a second method of evaluating the effect of large special items, we sort observations in the 
later period into those with and without large special items. There are not enough observations 
with large special items during the first years of the early period to conduct this analysis in the 
early period. The results, which are reported in Table 3, Panel B, demonstrate the dramatic effect 
of special items. For observations without special items, the late period coefficient estimate for 
contemporaneous expense from Equation (1) is 1.046, which is similar to the early period coef- 
ficient estimate reported in Table 1 (1.032). In contrast, the late period contemporaneous expense 
coefficient for observations with large special items is 0.752, which is significantly less than the 
estimate for observations without large special items (t — 8.44; p « 0.01). In addition, observa- 
tions with large special items have significantly larger (smaller) estimates for volatility (persis- 
tence), demonstrating the effect of special items on these earnings properties. 


in the previous twelve quarters, the persistence of those special items is still substantially below that of earnings adjusted 

for special items (see Cready et al. 2010, Table 3). Thus, while the recurring nature of some special items might partially 

mitigate the effects we report in Table 2, such mitigation is likely to be slight. 
7 Although the weight on COGS declines substantially from the early to the late period in Table 2, we view the decline 
in the revenue-expense relation as a function more of special items rather than COGS. The weights must sum to 1, such 
that a dramatic increase in relative importance of special items must mechanically be offset by a decline in relative 
importance of other components. This issue is analogous to the product mix problem in management accounting where 
а company has a low-margin product (LM) and a high-margin product (HM). Assume that HM volume grows 10 percent 
and LM volume doubles, shifting the mix away from HM and toward LM and reducing overall profit margin. In this 
case it is logical to attribute the decline in profit margin to LM, the product whose growth was extreme. Similarly, in our 
case, we attribute the weight effect to special items, the component whose growth was extreme. In an untabulated 
analysis we find that more than 90 percent of the change in the coefficient estimate on aggregate expense in Equation 
(1) is due to the combined effects of the change in the coefficient estimate and the change in the weight of special items | 
(details available from the authors on request). 
Specifically, we re-estimate Equation (1) after deleting all observations with a large special item in the current year 
(5,330 observations), we re-estimate the volatility measures after deleting observations with a large special item in any 
of the prior five years (12,145 observations), and we re-estimate the persistence regression after deleting observations 
with large special items in the current or previous years (7,728 observations). In addition, we repeat these analyses with 
a smaller, common sample formed by the intersection of these three subsamples, and results are very similar. Our 
definition of “large” is consistent with that used by Elliott and Hanna (1996). 
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TABLE 3 
Properties of Earnings-Related Variables over Time 
Adjusting for Large Special Items 


Panel A: Early versus Late Period, Excluding Observations with Large Special Items 


Ехр,_1 Ехр, Ехрил 
1967-1985 (No Large 51) -0.003 1.042 ~ 0.036 
1986—2005 (No Large 51) 0.010 1.046 -0.046 
Difference 0.013* 0.004 —0.010* 
t-statistic (1.97) (0.42) ` (—1Л3) 

Volatility Persistence 

1967-1985 (No Large 50) 0.020 С.822 
1986-2005 (No Large SJ) 0.022 0.819 
Difference 0.002 ** —0.003 
t-statistic (3.11) (-0.15) 


Panel B: Late Period, With versus Without Large Special Items 


EPa. Br _ DEB | 
1986-2005 (No Large 51) 0.010 1.046 —0.046 
1986-2005 (Large 51) 0.165 0.752 0.076 
Difference 0.1554 ** —0.294*** 0.123*** 
t-statistic - (8.10) (-8.44) (5.53) 
Volatility Persistence 
1986-2005 (No Large 51) 0.022 0.819 
1986-2005 (Large 51) 0.032 0.529 
Difference 0.010*** —0.290*** 
t-statistic (15.94) (-1143 


Panel С: Early versus Late Period, Based on Operating Income 


EPa 2 uuu. Ls EN ee 
1967-1985 -0.012 . 1.067 —0.046 
1986-2005 0.011 1.059 —0.058 
Difference 0.024** —0.008 —0.012 
t-statistic (2.85) (-0.63) (- 1.35) 
Volatility Persistence 
1967-1985 0.035 0.819 
1986-2005 0.034 0.813 
Difference -0.001 -0.006 
t-statistic (-0.50) (-0.15) 
(continued on next page) 
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Panel D: Early versus Late, Adding Back Special Items 


Ege _ EP EPa 
1967-1985 —0.003 1.038 —0.031 
1986-2005 0.023 1.024 —0.038 
Difference 0.025*** —0.013* —0.007 
t-statistic (3.99) (—1.70) (—1.09) 
Volatility Persistence 
1967-1985 0.021 0.822 
1986-2005 0.027 0.786 
Difference 0.096% ** —0.036* 
t-statistic (6.56) (—1.87) 


* ЖЖ ЖЖЖ Represent significance at the 10 percent, 5 percent, and 1 percent levels, respectively, in two-tailed tests. 
For difference calculations only, t-statistics are in parentheses below the coefficient estimates. 
This table presents properties of earnings-related variables for two time periods, 1967-1985 and 1986-2005, utilizing four 
different methods to control for the presence of special items. All regression specifications are equivalent to those estimated 
in Table 1 except as noted below. 
In each panel, the first set of results presents average annual coefficients from estimating: 

Model 1: Rev, = у) + s Expi + ЊЕХРр + EXPL Aa + В. 
In each panel, the second set of results presents average firm-specific volatility of Earn and average coefficients (persis- 
tence) from estimating: 

Model 2: Earn, = y+ y Earn; + 81. 
Panel A presents results for the early and late time periods after excluding firm-years with large special items (defined as 
greater than 1 percent of average total assets). 
Panel B presents results for the late time period, comparing observations with large special items to observations without 
large special items. 


Panel C presents results for the early and late time periods based on operating income after depreciation as the earnings 
measure. 


Panel D presents results for the early and late time periods after adding back special items to earnings before discontinued 
operations. 
Variables are defined in the Appendix. 


Our third and fourth analyses avoid the deletion of observations and use alternative earnings 
measures to evaluate the effect of special items. In the third analysis, we replace net income before 
extraordinary items with operating income after depreciation (but before special items) and we 
replace total expense with revenue minus operating income after depreciation. In the fourth analy- 
sis, we add special items back to net income before extraordinary items and replace total expense 
with revenue minus this modified income figure. 

The results for the third (fourth) analysis are reported in Panel C (D) of Table 3. In Panel C 
(D), the change in the coefficient estimate for current-period expense from Equation (1) from the 
early to the late period is —0.008 (—0.013), as compared to —0.137 in Table 1. The difference in 
Panel C is 6 percent of the original difference and is not significantly different from 0 (p = 0.52), 
while the difference in Panel D is 10 percent of the original difference and is weakly significant (p 
0.10). Also, in Panel C, the changes in both volatility and persistence from the early to the late 
period are small when compared to the results in Table 1, and neither change is statistically 
significant (p — 0.61 and 0.59, respectively). In Panel D, when special items are added back to 
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earnings, the increase (decrease) in volatility (persistence) is 43 percent (29 percent) of the in- 
crease (decrease) reported in Table 1, and both changes are significant (р < 0.01; p = 0.07)? 

We also observe that adjusting for special items in the current period affects the coefficient 
estimates on aggregate expense in the surrounding periods. Specifically, the increases in the 
coefficient estimates on non-contemporaneous expenses are attenuated or reversed. In Table 1, 
these coefficient estimates likely increased because the increased incidence of special items added 
noise to the contemporaneous relation between revenue and total current expense over time. This 
noise was likely compensated for by an increase in the relation between current revenue and 
non-current expense because (1) these expenses are highly correlated with prior period and next 
period revenue, and (2) revenue is highly persistent from one period to the next. Thus, all of the 
patterns we observed in Table 1 are substantially reduced by each of our adjustments for special 
items in Table 3. 

Taken together, these results suggest that the significant patterns documented by DT and 
reported in Table 1 are due primarily to the increase in the weight of special items as a component 
of total expense. In the next sections, we provide further evidence on the increase in special items 
over time and explore explanations for this phenomenon. 


ГУ. CHANGES OVER TIME IN THE INCIDENCE OF SPECIAL ITEMS 

To provide a general view of the extent to which special items are increasing during the 
sample period, we first plot the percent of sample observations that reported a large special item 
for each year from 1967 to 2005 in Figure 1. The incidence of large special items is below 5 
percent before 1973 and rises to almost 50 percent in 2001. This pattern is consistent with Elliott 
and Hanna (1996, Figure 1) and Fairfield et al. (2009, Figure 1). 

Next, we provide additional descriptive evidence on changes in the frequency of commonly 
reported special items using data reported by Accounting Trends and Techniques, an annual pub- 
lication of the American Institute of Certified Public Accountants. Each volume contains examples 
of corporate disclosures and tabulations of reporting practices for 600 industrial, retail, and service 
companies chosen from the Fortune 1000. Throughout most of our study period, Accounting 
Trends and Techniques has included a tabulation of various types of other revenue and gains and 
other expenses and losses disclosed by firms. We summarize these tabulations in Table 4, Panel A 
for gains and in Panel B for losses. We tabulate only items that the Compustat manual includes in 
its definition of special items. Each cell reports the percent of the 600 firms surveyed by Account- 
ing Trends and Techniques reporting that item that year. The columns sum to more than 100 
percent because a company may have more than one type of gain or loss (e.g., sale of assets and 
litigation settlement), and a gain or loss may count in more than one category (e.g., write-down of 
assets and restructuring of operations)." 

The results in Table 4, Panel A, indicate a steady increase in gain special items between 1975 
and 2005, with average total incidence for the first (last) three years of 9 percent (54 percent), 


? The tests reported in Panel D of Table 3 are similar to a robustness test conducted by DT. DT report that the original 
change in the coefficient estimate on contemporaneous expense from Equation (1) of —0.149 falls to —0.081 when 
special items are added back to income. We find that our original change of —0.137 falls to —0.013 after adding back 
special items, which is consistent with our decomposition results in Table 2 and our results from the additional 
adjustments for special items in Table 3. 

10 There is a substantial overlap between this group and our sample. For example, for 2002, of the 750 firms in our sample, 
350 (47 percent) were also included in the firms surveyed by Accounting Trends and Techniques. 

H Compustat and Accounting Trends and Techniques use similar procedures to identify special items. In both cases, items 
are identified as a special item when they are disclosed either in the main financial statements or in the financial 
statement notes. See Burgstahler et al. (2002) for discussion of Compustat procedures. 
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FIGURE 1 
Large Special Items over Time 
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This figure graphs the incidence of large special items (those greater than 1 percent of average total assets) рег 
year in the sample. 


which is about a sixfold increase. Among the individual gain special items, only asset sales have 
a consistently high frequency, rising from 8 percent in 1975 to 37 percent in 2005. 

The incidence of loss special items (Panel B) is rising over time more rapidly than the 
incidence of gain special items. The most common loss special items are asset write-downs, 
restructuring charges, and losses on asset sales. Across all loss special items, the average total 
incidence for the first (last) three years reported in the table is 5 percent (126 percent), which is 
more than a 25-fold increase. Even if we eliminate write-downs and asset sales as being included 
in restructurings in later years, loss special items still increase 16-fold. Loss special items are 
increasing faster than gain special items both because the types of items being reported are 
increasing and because of increases in the frequency of particular items being reported. For 
example, only two types of loss special items are reported in Accounting Trends and Techniques 
for 1980, while for 2005 eight types of loss special items have reported amounts. Also, in 1980 
only 3 percent of firms report an asset write-down, but this percentage increases to 35 percent in 
2005. 
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V. ALTERNATIVE EXPLANATIONS FOR THE INCREASING FREQUENCY OF 
SPECIAL ITEMS 

In general there are three potential reasons for the increase in special items over the past 
several decades. First, changes in economic activity could result in firms recording more special 
items than before. For example, it could be more common in the 2000s than in the 1960s for firms 
to experience impaired assets, sell an asset before it is used up, or lay off employees. Such changes 
in economic activity can lead to an increase in special items even when there has been no change 
in formal accounting standards. Second, new accounting standards could either require firms to 
record a special item that was previously ignored or provide more specific guidance that effec- 
tively encourages firms to record a special item more frequently than before. Below, we discuss 
specific accounting standards that relate to special items. Third, changes in accounting and dis- 
closure practice can lead to firms recording more special items than previously even when eco- 
nomic events and formal accounting standards are not changing. For example, firms may begin to 
more actively write down assets that have been impaired rather than continuing to depreciate them 
according to the original depreciation schedule.'? Such changes in accounting practice could occur 
if the financial reporting incentives for the firm or manager change. However, these changes are 
difficult to disentangle from changes in accounting practice that are precipitated by changes in 
economic activity or formal accounting standards. 

Despite a significant amount of research on special items, there is little evidence on which of 
these three potential reasons for the increase in special items is most important. Indeed, Frankel 
(2009, 239) states that “we do not understand why the use of special items has increased.” In this 
section, we provide evidence on these potential reasons for the increase in special items in recent 
decades. Because changes in accounting practice are difficult to separate from the other two 
reasons, we focus on economic activity and accounting standards as two alternative, non-exclusive 
explanations for the increase in special items and the resulting effect on the properties of account- 
ing earnings. 

DT also consider accounting standards and economic explanations for their findings by testing 
whether their main results obtain for both accrual and cash flow expense measures. They find that 
the changes in the properties of earnings are attributable primarily to accruals and conclude that 
“accounting-based factors are a substantial and perhaps even primary determinant of the docu- 
mented temporal patterns of earnings characteristics, while real-economy changes play a second- 
ary role" (DT 2008, 1453).'* Our finding that special items are responsible for the change in 
earnings properties is consistent with DT’s finding with respect to accruals because special items 
are almost exclusively accruals (Dechow and Ge 2006). However, it is not clear ex ante whether 
more frequent accruals for restructuring charges, impairments, and asset sales are driven predomi- 
nantly by new accounting standards or by economic factors. 


12 Alternatively, firms may begin to more transparently disclose write-downs that they were already recording but not 
separately disclosing. We note, however, that more transparent disclosure of a constant frequency and magnitude of 
special items would not explain the changes from the early to the late period reported in Table 1. 

13 Recent research by McVay (2006) and Fan et al. (2010) provides evidence that managers opportunistically manage 

recurring firm performance by including recurring charges as one-time special items. However, neither of these studies 

provides evidence on whether such classification shifting is either increasing over time or affects the frequency of 
special items. Moreover, such classification shifting, which results in special items that are more persistent, is likely to 

partially mitigate the disruptive effect of special items on the revenue-expense relation documented in Tables 2 and 3. 

Thus, while classification shifting is an important issue in accounting for special items, it is not likely to contribute to 

the role of special items in the revenue-expense relation. 

DT also examine whether changes in industry composition in their sample drive their results as another potential 

economic explanation. We find that special items are increasing across virtually all industries (untabulated). 
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The Role of Economic Events 


In this subsection we examine whether an increase in economic activity associated with 
special items has contributed to the increasing frequency of special items. To examine the role of 
changes in economic activity, we construct an index that includes specific economic events that 
cannot arise from bookkeeping practices alone, but are frequently associated with firms recording 
special items. We then examine whether this index is associated with the frequency of special 
items across firms and over time. 

We begin by identifying five firm-year indicator variables that represent real economic events 
that are likely to be associated with the most common special items in Table 4. Our first measure 
(Emp) takes the value of 1 if the firm has experienced negative employee growth that year, and 0 
otherwise. Negative employee growth is often the result of contracting business, and can be the 
direct result of restructuring layoffs that result in a special item. Our second and third measures are 
merger and acquisition activity (M&A) and discontinued operations (Disc). These variables reflect 
underlying growth or contraction through firm entry and exit in markets. Firms that enter or exit 
new markets or product lines are also likely to restructure operations or sell or write off assets, so 
we expect M&A and Disc to be related to the incidence of special items. The fourth and fifth 
measures indicate if the firm has an operating loss (Loss) or if the firm has negative sales growth 
(Grow). Firms that experience an operating loss or declining sales often write off assets or re- 
structure their businesses to become more competitive, so these variables are also expected to be 
associated with the probability of recording a special item. 

To provide evidence that these economic event variables are linked to the incidence of special 
items, we combine them into a single measure and examine the relationship of this measure to the 
incidence of special items. We give each observation a point 1f the observation has a 1 for each 
indicator variable, and then sum the points across the five economic event variables for each 
firm-year observation and label that sum E-Score. Thus, firm-year observations with an E-Score of 
0 (5) have a 0 (1) for each indicator variable, indicating a low (high) level of economic activity 
associated with special items. 

To successfully capture economic events associated with the increasing incidence of special 
items, this index should increase over time and should also be associated with the cross-sectional 
probability of recording a special item. To examine whether the index is increasing over time, we 
calculate the average E-Score each year during the study period and then average the annual 
averages for four sub-periods, 1967-1974, 1975-1984, 1985-1996, and 1997-2005. The results 
are reported in the first row of Table 5, Panel A. The average E-Score increases steadily from 0.73 
for the first sub-period to 1.15 for the last sub-period, an increase of nearly 60 percent. The 
correlation between average E-Score and a time trend is 0.55 (p < 0.01). 

To provide further evidence that E-Score is increasing over time, we divide the sample each 
year into three groups on the basis of E-Score. The "high" group consists of observations with 
E-Score equal to 3, 4, or 5, the “middle” group consists of observations with E-Score equal to 1 or 
2, and the “low” group consists of observations with E-Score equal to 0. The next two rows in 
Panel A report that the middle (high) E-Score group has 50 (4) percent of the observations in the 
first sub-period, rising to 58 (11) percent of the observations in the last sub-period, confirming the 
increase in E-Score over time. 

The results for each year are presented graphically in Figure 2, along with fitted trend lines, 
which indicate that the incidence of large special items for the high E-Score group is much higher 
than for the low group. Further, the incidence for the high group is growing at more than twice the 
rate for the low group. The trend line slope for the high group is 0.017 (i.e., an average increase 


15 We also observe statistically significant (at least р < 0.10) increases over time for all five components of E-Score. 
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TABLE 5 
E-Score Classifications and Special Items 


Panel A: E-Score over Time 


Mean Mean Mean Mean Time 
67-74 75-84 85-96 97-05 Trend p-value 











Level of E-Score 0.73 0.90 1.03 1.15 0.55 <0.01 
Incidence of Middle E-Score Observations 0.50 0.52 0.54 0.58 0.49 <0.01 
Incidence of High E-Score Observations 0.04 0.07 0.10 0.11 0.54 «0.01 
Panel B: Cross-Sectional Relation between E-Score and Special Item Incidence 
Low Middle High 

E-Score E-Score E-Score Diff 
1967-1974 0.7 1.9 5.1 44 
1975—1984 5.0 10.8 20.1 15.1 
1985-1996 13.4 26.6 447 31.2 
1997—2005 22.6 38.0 55.7 33.2 
Diff 21.9 36.1 50.7 


We rank each firm-year observation on the incidence of five economic event variables from 1967 to 2005: negative 
employee growth, M&A, discontinued operations, negative revenue growth, operating losses. We give a firm-year obser- 
vation one point for each indicator variable coded as 1. We then sum the points across the five variables for each firm-year 
observation, and label that sum E-Score. Thus, across observations, the E-Score can conceivably vary from 0 points 
(indicating the presence of no economic events) to 5 points (indicating the presence of all economic events). We then 
divide the sample each year into three groups: 0 points for the "Low" group (roughly bottom 25 percent of sample), 1 to 
2 points for the "Middle" Group, and 3-5 points for the "High" group (roughly top 10 percent of sample). 

In Panel A, we plot mean values of E-Score over four sub-periods between 1967—2005, along with a correlation with this 
time-series and a trend variable. 


In Panel B, we compute the incidence of large special items for each of the three groups each year and then average over 
the four sub-periods. Bolded differences are significant at the 1 percent level (two-sided). 


in the incidence of special items of 1.7 percent per year), compared with a slope of 0.007 for the 
low group, and the difference is significant at p « 0.01 (untabulated). This simple analysis 
provides strong evidence that our index of economic events is increasing over time and that it 
captures cross-sectional variation in the incidence of special items. 

To provide further evidence that E-Score is increasing over time and is associated with the 
cross-sectional probability of recording a special item, we compute the average incidence of 
special items for the three E-Score groups for each of four sub-periods. The results, reported in 
Table 5, Panel B, reveal two clear patterns. First, the rows indicate that the average incidence of 
large special items for the high E-Score firms is more than double the incidence for the low 
E-Score firms in all four sub-periods. For example, in the most recent period, from 1997 to 2005, 
the incidence of large special items for firms in the high E-Score group is 55.7 percent, as 
compared to only 22.6 percent for the firms in the low E-Score group. These patterns indicate that 
E-Score is positively associated with the incidence of special items. 15 





15 To document that variation in E-Score is directly related to deterioration in the revenue-expense relation, we also 
re-estimate Equation (1) each year separately for each of the three E-Score groups. In untabulated results we find that the 
coefficient estimate on contemporaneous expense declines overall as E-Score increases: 0.96, 0.98, and 0.84 for the low, 
middle and high E-Score groups, respectively. 
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FIGURE 2 
Firm-Specific Economic Variables and Special Items 





...... LOW 


— Medium 


== ww High 








This figure graphs the incidence of special items for firms based on E-Score by year. E-Score is defined in Table 
5, Firms in the “High” group have a score of 3 or higher, firms in the “Medium” group have a score of 1 or 2, 
while firms in the "Low" group have a score of 0. Scores are calculated by ranking each firm-year observation 
on each of the five economic variables in Table 5 from 1967 to 2005, and giving the observation one point for 
each indicator variable. We then sum the points across the five variables for each firm-year observation. 


Second, the columns indicate that the incidence of special items increases through time for all 
three groups, and the differences between the incidence for the early sub-period and the late 
sub-period are significant at the 0.01 level for all three groups (last row). Thus, special items 
become more common even for relatively fixed levels of our economic activity index, indicating 
that E-Score does not capture all determinants of special items. This is reinforced by the variation 
around the fitted lines in Figure 2. One explanation for this finding is that changes in economic 
activity not captured by the five simple variables comprising E-Score are responsible for increases 
in special items. An alternative explanation is that, holding economic events related to special 
items constant, other factors, such as new accounting standards, led firms to record more special 
items. We examine this explanation in the next subsection. 


The Role of Accounting Standards 


To examine the role of accounting standards, we identify the specific standards that were 
adopted during our study period that are most likely to have had an effect on the frequency of 
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special items. We then examine the frequency of special items both in the years (1) prior to 
implementation of these standards (when they had no effect,) and (2) after implementation of these 
standards (when the effect, if any, should take place). 

The general accounting standard that applied to special items for most of the study period is 
APB Opinion No. 30 (AICPA 1973) on reporting the results of operations, which became effective 
in 1973. This standard does not use the term “special items” but rather paragraph 26 states that “A 
material event or transaction that is unusual in nature or occurs infrequently but not both ... should 
be reported as a separate component of income from continuing operations. The nature and 
financial effects of each event or transaction should be disclosed on the face of the income 
statement or, alternatively, in notes to the financial statements.” The results in Table 4 suggest that 
five items account for the majority of both the increase and the level of these separately disclosed 
income components: asset write-downs, sales of assets, restructurings, and, toward the end of the 
period, intangibles impairment and debt extinguishment. We examine specific accounting stan- 
dards that were adopted during the study period that apply to each of these items. 

Our search reveals that the earliest accounting standard related to special items adopted during 
our study period is Staff Accounting Bulletin 67 on restructurings, which was issued by the SEC 
in 1986 and is discussed in more detail below. However, Figure 1 suggests that the frequency of 
special items was increasing prior to 1986. To evaluate whether this is the case, we estimate the 
following time-series regression, with one observation per year from 1975 to 1985: 


%SI,= у + у Тгепа, + y,E-ScoreAvg,+ Ep, (4) 


where 9651, is the percent of firms reporting a special item in year г, and Trend is a variable that 
goes from 1 (for 1975) to 11 (for 1985). E-ScoreAvg, is the average for E-Score (defined earlier) 
across firms for year t. We estimate Equation (4) using Accounting Trends and Techniques data." 
We compute 9657 as the sum of the percent of gain special items and loss special items reported in 
Panels A and B of Table 4 (e.g., for 1983 we use 27 + 26 = 53 percent). As the first column of 
Table 6 demonstrates, the estimated coefficient for Trend is 0.012 (t = 7.67; p < 0.01) and the 
estimated coefficient for E-ScoreAvg is 0.032 (t = 2.08; p = 0.071). These results indicate a 
pronounced increase in total special items during the 11-year period prior to the first accounting 
standard related to special items, as well as some evidence of a role for our economic event proxy, 
E-Score. 


Asset Write-Downs 


In 1984, the EITF acknowledged that the frequency of asset write-downs had been increasing 
over time, despite no formal guidance in GAAP for their recognition (see EITF 84-28, in which no 
consensus on a standard was reached). The first standard to explicitly address asset write-downs 
was SFAS 121, effective in 1996, with early adoption in 1995 "encouraged." This standard was 
superseded by SFAS 144 in 2002, but virtually all of the asset impairment rules were left intact. In 
Table 4, Panel A, asset write-downs increase from 11 percent in 1994 to 19 percent in 1996, but 
we observe an upward trend both before and after that interval, and larger single year increases in 
1998 and 2001. Further, we note that SFAS 121 explicitly cites the widespread reporting of asset 
impairments without specific authoritative guidance as motivation for the standard (paragraph 5), 
suggesting this standard was at least partly reactive in nature (see Francis et al. 1996, 117-118). 

To evaluate whether SFAS 121 increased asset write-downs, we use data from Accounting 


17 Results are very similar and inferences are unchanged using Compustat data for this test. We report results utilizing the 
Accounting Trends and Techniques data for consistency throughout the tests reported in Table 6. 
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TABLE 6 
Regression Analysis 
Special Item Incidence and Accounting Standards 
Model 4 Model 5 Model 6 Model 7 
Intercept —0.090*** —0.110*** 0.003 —0.352*** 
(—4.69) (-3.44) (0.08) (—6.05) 
Trend 0.012*** 0.006*+* “ 0.007%** 0.015*** 
(7.67) (3.98) (5.92) (5.66) 
E-ScoreAvg 0.032* 0.069" + 0.101** 0.201*** 
(2.08) (2.29) (2.29) (5.19) 
D86 -0.019 
(-0.68) 
D95 0.118*** -0.037 
(4.78) (-1.19) 
Adj. R? 0.914 0.910 0.688 0.887 
п 11 30 30 26 


Ж жж *** Represent significance at the 10 percent, 5 percent, and 1 percent levels, respectively, in one-tailed tests 
(positive coefficients expected), except the intercept. 

t-statistics are in parentheses below the coefficient estimates. 

This table presents results from estimations of the following models: 


Model 4: WSI, = у + у Trend, + ^4, E-ScoreAvg, + в,. 
Model 5: %WD,= yg + Trend, + у 095, + 4E-ScoreAvg, + 8,. 
Model 6: %SA,= W + у Тгепа, + 4E-ScoreAvg, + е. 


Model 7: %Rest, = yy + y Trend, + 086, + 4D95, + y,E-ScoreAug, + £,. 
The models are OLS time-series regressions with 11-30 annual observations from 1975-2005, as noted under the model 


number, depending on data availability from Accounting Trends and Techniques, summarized in Table 4. 
E-ScoreAvg is annual average of the E-Score variable defined in Table 5. All main variables are defined in the Appendix. 


Trends and Techniques in Table 4 to estimate the following time-series regression, with one 
observation each year from 1976 to 2005: 


HWD,= Yo + у Trend, + у, 095, уз Е-5сотеАив, + ёр, (5) 


where %WD, is the percent of firms surveyed by Accounting Trends and Techniques that reported 
an asset write-down in year t, Trend takes values from 1 (for 1976) to 30 (for 2005), and D95 is 
an indicator variable that takes the value of 1 if the year is 1995 or later, and 0 otherwise. 
E-ScoreAvg, is as defined earlier. 

The second column of Table 6 reports estimates from Equation (5). Consistent with the 
anecdotal observations in EITF 84-28 and SFAS 121, there is evidence of a strong independent 
time trend in the incidence of asset write-downs (t = 3.98; p « 0.01). There is also evidence that 
our index of economic events helps explain the incidence of write-down activity over time (t — 
2.29; p -- 0.03). Finally, the coefficient for D95 is 0.118 (t — 4.78; p « 0.01), indicating that the 
incidence of asset write-downs increased about 12 percentage points in the years following SFAS 
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121. Thus, it appears that SFAS 121 likely encouraged or legitimized asset write-down activity. 


Sale of Assets 

Table 4 reports a slow increase over time for this item for both gains and losses. However, 
because no specific standard applies to asset sales, this contribution does not appear to be due to 
changes in accounting standards. To evaluate whether the frequency of asset sales is increasing 
even though it is not affected by a new accounting standard, we estimate the following time-series 
regression, with one observation per year from 1975 to 2005: 


968A, = Yo у Trend, + yE-ScoreAvg, + £, (6) 


where %SA, is the percent of firms surveyed by Accounting Trends and Techniques that reported 
a gain or loss from asset sales in year 7, and Trend is a variable that goes from 1 (for 1975) to 31 
(for 2005). E-ScoreAvg, is as defined earlier. The third column in Table 6 indicates that the 
estimated coefficient for Trend is 0.007 (t = 5.92; p < 0.01), consistent with the frequency of 
gains and losses from asset sales growing significantly during this period when no accounting 
standards related to asset sales were adopted. In addition, the estimated coefficient for E-ScoreAvg 
is 0.101 (t — 2.29; p — 0.03), consistent with economic events playing a role in the increase in 
asset sales. 


Restructuring of Operations 

The first standard to address this issue is SEC Staff Accounting Bulletin 67, issued in 1986, 
which states that restructurings should be considered part of operations and disclosed separately if 
material. Later, EITF 94-3 (adopted in 1995) provides guidance on when a restructuring liability 
can be recorded. This guidance was revised by SFAS 146 in 2002 (effective in 2003). Restructur- 
ings were first reported in Accounting Trends and Techniques in 1980 when the frequency was 8 
percent. Six years later, the SEC reacted to this new disclosure by issuing SAB 67. Overall, there 
is a steady increase in the frequency of restructurings throughout the period with spikes in 1986, 
1993, 2001, and 2002. One of these is coincident with SAB 67, which may have had the effect of 
legitimizing this type of disclosure. The spike in 1993 could have resulted from the economic 
downturn at that time, and may have precipitated the Emerging Issues Task Force to consider the 
issue in 1994. The additional spikes in 2001 and 2002 are likely the result of 9/11 and the bursting 
of the technology stock bubble. 

To evaluate whether tbe frequency of special items was affected by these changes in account- 
ing standards, we estimate the following time-series regression, with one observation per year 
from 1980 to 2005: 


9b Rest, = Yo + y Trend, + у 086, + y,D95, + y,E-ScoreAug, + 8 (7) 


where %Rest, is the percent of firms surveyed by Accounting Trends and Techniques that reported 
a restructuring loss in year 1, Trend is a variable that goes from 1 (for 1980) to 26 (for 2005), and 
D86 (D95) is an indicator variable that takes the value of 1 for all years from 1986 (1995) and 
later, and 0 (0) otherwise. Thus, D86 captures the incremental increase in the incidence of restruc- 
turing activity from 1986 to 1994 (after SAB 67), and D95 captures the incremental shift in 1995 
and thereafter (after EITF 94-3). E-ScoreAvg, is as defined earlier. 

In the fourth column of Table 6, the estimated coefficient for Trend is 0.015 (t = 5.66; p « 
0.01), indicating a significant increase in the frequency of restructurings. The estimated coeffi- 
cients on 1286 and D95 are insignificant, indicating no increase in restructurings after these ac- 
counting standards. Finally, the estimated coefficient of 0.201 for E-ScoreAvg is significant (t — 
5.19; p « 0.01), consistent with economic activity explaining some of the increasing incidence of 
restructurings. 
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Impairment of Intangibles and Debt Extinguishment 

Intangible assets were subject to SFAS 121, but Accounting Trends and Techniques did not 
begin reporting this item until 2003, after the adoption of SFAS 142. This standard applies 
specifically to intangible assets and took effect in 2002. SFAS 145, which took effect in 2003 for 
calendar-year firms, applies to debt extinguishment transactions. This item first appeared in Ac- 
counting Trends and Techniques in 2003 at the 10 percent frequency level, and has maintained 
about that level since. We do not conduct tests for these items because SFAS 142 and SFAS 145 
were adopted too recently to explain the nearly 30 years of increasing frequency of special items 
prior to 2002. 

The results summarized above indicate that the incidence of total special items was increasing 
for some years prior to the adoption of a specific accounting standard addressing special items. We 
also find that gains and losses from asset sales were increasing throughout the study period even 
though no new accounting standard affected these transactions. However, there is evidence that a 
new accounting standard for asset write-downs (SFAS 121) could have contributed to the increas- 
ing frequency of these special items. Importantly, our proxy for economic events (E-ScoreAvg) is 
significant in each test." Thus, in general, we observe a stronger and more sustained role for 
changes in economic activity as an explanation for the increasing frequency of special items and 
a narrower role for changes in specific accounting standards. 

To combine the above analyses into one test, we use Compustat data to compare the relative 
contributions of specific accounting standards and the economic activity represented by E-Score to 
the increase in the incidence of total special items over time. We estimate the following pooled 
time-series and cross-sectional regression at the firm-year level for 1967-2005: 


51, = Yo + у D86i, , + уг 095, + y3D96,  - y4D02; t y5D03; ,  ysE-Score,, + yyTrend; + 8 
(8) 


In Equation (8), SI; is an indicator variable equal to 1 if firm i in year t reported a large special 
item, and 0 otherwise. D86, 095, and D96 are defined as in Equations (4){7) and 202 and роз 
are defined analogously (2002 for SFAS 142 and 2003 for SFAS 145 and SFAS 146). Thus, these 
indicator variables are defined such that successive variables indicate the incremental frequency of 
special items until the next indicator variable is turned on. For example, the coefficient estimate on 
D96 represents the incremental frequency for the period 1996 to 2001 (prior to 202), relative to 
the period prior to 1996. E-Scare and Trend are as defined above. 9 

The results for our estimation of Equation (8) are presented in the first column of Table 7. 
None of the coefficient estimates for the accounting standard indicator variables are positive. In 
fact, several are significantly negative.” Thus, once the time trend and E-Score are controlled, 
these results provide no evidence that the increase in total special items is due to the implemen- 
tation of these accounting standards. In contrast, the coefficient estimate on E-Score is positive 
(0.512) and highly significant (t = 24.87; p < 0.01). The coefficient estimate on Trend is also 
positive (0.127) and highly significant (t = 7.98; p < 0.01), indicating that, even after controlling 
for E-Score, the incidence of special items is increasing substantially during the study period. 

As an alternative analysis, we also estimate the following time-series regression with one 
observation for each year from 1967-2005: 


18 Inferences with respect to accounting standards and the time trend are unchanged if E-ScoreAvg is omitted. 

19 We are grateful to an anonymous reviewer for suggesting the analysis in models 8 and 9. 

Ж The coefficient estimates for 202 and ДОЗ are significantly negative, indicating that special items fell significantly from 
the 2001 level (likely due to 9/11), The coefficient estimate on 095 is also significantly negative. 
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TABLE 7 
Regression Analysis 
Accounting Standards and Economic Events 
Model 8 Model 9 
Intercept —4,583*** —0.073*** 
(—20.12) (—4.06) 
D86 —0.173 0.029 
(—1.03) (1.51) 
D95 —0.249** 0.005 
(-2.25) (0.17) 
D96 -0.129 0.046 
(—1.64) (1.51) 
D02 —0.669*** —0.062* 
(-8.25) (-2.03) 
роз —0.321*** —0.032 
. (—4.93) (—0.97) 
E-Score 0.512*** 
(24.87) 
E-ScoreAvg 0.076*** 
(3.53) 
Trend 0.127*** 0.008*** 
(7.98) (7.03) 
Pseudo/Adj. R? 0.147 0.954 


Ж жж F** Represent significance at the 10 percent, 5 percent, and 1 percent levels, respectively, in one-tailed tests 


(positive coefficients expected), except the intercept. 
t-statistics are in parentheses below the coefficient estimates and are based on standard errors clustered by firm and year. 


This table presents results from estimations of the following models: 
Model 8: 51, = у) + 4D86,, + 72095, + ур96,, + y4D02; 4+ y5D03, , + YeE-Score,, + yy Trendy, + ё. 


Model 9: 951, w + у 086, + %095, + 4D96, + у,р02, + у;003, + увЕ-5соге, + учТгепа, + в,. 


Model 8 is a pooled logistic regression at the firm-year level where 57 is an indicator variable taking the value of 1 if a 
large special item is reported, 0 otherwise. E-Score is the firm-year variable defined in Table 5. The time period is 
1967-2005. 


Model 9 is OLS time-series regression with 39 annual observations from 1967-2005. In model 9, E-ScoreAvg is the annual 
average of E-Score. 


Other variables are defined in the Appendix. 


PSI = Yo + у 086, + у, 095, + 4D96, + y4D02,+ YsD03,+ ysE-ScoreAvg, + yTrend, + е, 
(9) 


In Equation (9), 9051, is the percent of observations with a special item greater than 1 percent of 
total assets for year f. The year indicator variables in Equation (9) are identical to those in 
Equation (8), while E-ScoreAvg and Trend are as defined earlier in Equations (4)-(7). 

The results from estimating Equation (9) are presented in the second column of Table 7. 
Consistent with the results for Equation (8), none of the coefficient estimates on the accounting 
standard indicator variables are significantly positive. This suggests that these accounting stan- 
dards did not have a significant effect on the incidence of total special items once the time trend 
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and E-ScoreAvg are controlled. The coefficient estimate on E-ScoreAvg is positive (0.076) and 
significant (t = 3.53; p < 0.01). This indicates that, after controlling for the time trend, the 
incidence of special items is 7.6 percent higher for every one-point increase in average E-Score.”! 

The results for Equations (8) and (9) provide no support that adoption of the specific account- 
ing standards we investigate contributed significantly to the increase in total special items. In 
contrast, our composite measure of economic events is significantly related to the incidence of 
special items over the study period.” 


VI. COMPETITIVE PRESSURE AND THE CHANGING ECONOMIC ENVIRONMENT 

The evidence presented in the previous sections suggests that the increasing incidence of 
special items in recent decades is attributable, at least in part, to changes in the frequency of 
economic events associated with recording special items. In this section, we examine whether the 
increase in this type of economic activity is related to increases in the level of competition in the 
U.S. (and global) economy over the last 40 years that are documented in previous research. 


Prior Research on Increases in Competition 

A growing literature provides evidence that the level of competition in the U.S. economy has 
increased over the last several decades due to changes in trade, technology, and regulation, among 
other factors. For example, Shepherd (1982) documents a large increase in the level of competition 
throughout the U.S. economy between 1958 and 1980, which he attributes primarily to antitrust 
policies, with additional contributions from import competition and deregulation. Pryor (1994) 
reports that U.S. concentration ratios, a common inverse indicator of competition used in the 
industrial organization literature, declined between 1958 and 1982 when adjusted for imports, 
indicating that imports have contributed to increasing competition in U.S. industries.” 

More recent studies by Comin and Philippon (2005), Gaspar and Massa (2006), and Irvine 
and Pontiff (2009) document increases in competition later in our study period that they link to 
increases in firm volatility or performance instability. In addition, Klein and Marquardt (2006) 
show that the well-documented increasing incidence of accounting losses is attributable mostly to 
nonaccounting factors such as real firm performance, consistent with an increase in competition 
over time. In the executive labor market, Huson et al. (2001) provide evidence that forced CEO 
turnover has increased in recent years, and DeFond and Park (1999) report that the frequency of 
CEO turnover is positively related to the extent of competition in the industry. 

Additional studies provide evidence that firms take actions commonly associated with record- 
ing special items in response to competitive pressure. For example, Bernard et al. (2006) report 
that import-related competition is negatively associated with plant survival and growth, suggesting 
an association between import competition and asset write-offs. Holmstrom and Kaplan (2001, 
121) argue that corporations now regularly transform themselves, through “large waves of merger, 
takeover and restructuring” that they attribute to “increased dominance of capital markets.” Har- 
ford (2005) provides evidence that merger waves between 1981 and 2000 were related to deregu- 





21 We also estimate a "changes" version of Equation (9), where the indicator variables for accounting standards include 
both the implementation year and the following year to capture incremental changes in incidence following the passage 
of various accounting standards. The change in the percentage of large special items is strongly related to the change in 
average E-Score (AE-ScoreAvg). The coefficient estimate for AE-ScoreAvg is 0.072 (t = 2.86), the coefficient for time 
trend is insignificant, and no accounting standard variable is positive and significant. 

22 We view Equations (8) and (9) as complementary. Equation (8) allows E-Score to explain cross-sectional variation in the 
incidence of special items. This may control for "noise" unrelated to new accounting standards, resulting in more 
powerful tests for the effects of new accounting standards, Alternatively, this may control some of the variation due to 
new accounting standards, resulting in weaker tests for the effects of accounting standards. 

23 Бог additional discussion of increasing competition and its possible drivers, see Blinder (2000), Whitman (1999), Reich 
(2007), and London (2004). 
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lation and competition among industry peers (see Table 2). Taken together, this research provides 
direct evidence that competition has increased in the U.S. economy and indirect evidence that this 
increase in competition is related to company actions that are associated with recognizing special 
items. 


Empirical Analysis 

In this subsection, we examine whether increases in competitive pressure can help explain 
increases in our index of economic events related to special item recognition. Our first measure of 
competition is the Herfindahl index, a standard measure in industrial organization (e.g., Gaspar 
and Massa 2006). To compute the Herfindahl Index (Herf) we use the Compustat universe to sum 
across each industry and year each firm’s squared market share, where market share is firm 
revenue divided by industry revenue. We supplement this standard measure of competition with 
Altman’s (1968) z-score (Bank), a common measure of bankruptcy risk or financial distress used 
in accounting and finance research. We view financial distress as a logical byproduct of increased 
competition where the economic environment is less forgiving of operational errors. We measure 
z-score at the industry level in the prior fiscal year to better reflect the competitive environment the 
firm faces rather than the firm’s individual financial distress. Measuring z-score at the industry 
level on a lagged basis also avoids a mechanical relation between components of E-Score and the 
accounting inputs used to calculate z-score.” 

We examine these two measures individually and in combination. To combine them, we 
compute a P-Score analogous to our E-Score by giving a point for each of the two competitive 
pressure variables above the median value for that variable based on the sample distribution from 
1967-1974, and summing the points for each observation. This sum varies from 0 points (below 
the median for both variables) to two points (above the median for both variables). 

If the economic activity captured by E-Score is associated with increasing competitive pres- 
sure, we expect our measures of competitive pressure to increase over time and to be correlated 
with E-Score in the cross-section. To examine the behavior of our pressure variables over time, we 
compute the annual average values for Herf, Bank, and P-Score for each of the four sub-periods 
used previously, as well as the correlation with a time trend. The results are reported in Table 8, 
Panel A. All three measures are increasing over time (becoming less negative for Herf and Bank) 
and all correlations with the time trend are positive and significant (p < 0.01). 

To provide further evidence on this increasing trend, we divide the sample into three groups 
on the basis of P-Score. The “high” group consists of observations with P-Score equal to 2, the 
“middle” group consists of observations with P-Score equal to 1, and the “low” group consists of 
observations with P-Score equal to 0. The last row of Panel A reports that the average incidence 
of observations in the high P-Score group increases from 19 percent in the first sub-period to 45 
percent in the last sub-period. Taken together, these results indicate that the annual averages for 
these pressure variables are increasing during our study period. 

To examine whether our measures of competitive pressure are associated with E-Score in the 
cross-section, we compute the average E-Score for the three P-Score groups for each of the four 
sub-periods. The results are reported in Table 8, Panel B. The clearest pattern is looking across the 
columns for each row, where we observe that the average E-Score is increasing across P-Score 
groups (i.e., increasing in competitive pressure) in all four sub-periods. These increases are sta- 
tistically significant for the last three sub-periods (p < 0.01). For example, in the period 1997 to 
2005 the average E-Score is 0.80 for the low P-Score group, as compared to 1.20 for the high 
P-Score group. 


?* For both Herf and Bank we assign observations to industries based on SIC codes at the two-digit level. 
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TABLE 8 
Levels of Competitive Pressure Variables over Time 


Panel A: Competitive Pressure Variables by Time Period and Correlation with Time Trend 


Mean Mean Mean Mean Time 
67-74 75-84 85-96 97-05 ‘Trend p-value 

















Level of Herf -007 -007 -006 -0205 0.70 «10.01 
Level of Bank -3.97 3.56 3.41 3.13 0.56 <0.01 
Level of P-Score 0.95 1.15 1.17 1.40 0.65 <0.01 
Incidence of High P-Score Observations 0.19 0.27 0.30 0.45 0.63 «0.01 
Panel B: Average E-Score for P-Score Groups 
Low Middle High 

P-Score P-Score P-Score Diff 
1967-1974 0.69 0.73 0.74 0.04 
1975-1984 0.78 0.89 0.96 0.18 
1985-1996 0.86 1.06 1.03 0.17 
1997—2005 0.80 1.14 120 0.40 
Diff 0.11 0.41 0.46 





We use two variables to proxy for competitive pressure, the Herfindahl index (Herf) and Altman's z-score (Bank). See the 
Appendix for variable definitions. We give each observation in any period one point for each value of Herf or Bank below 
the median value for these variables in the period 1967-1974, and then sum points across observations to form an index 
called P-Score. This index can range from 0 to 2 points. We label observations with a value of 0 (roughly bottom 10 
percent of the sample) as having a “Low” P-Score, observations with я value of 1 as having а “Middle” P-Score, and 
observations with 2 points as having a “High” P-Score (roughly top 25 percent of sample). 

Pane] A plots average levels of P-Score and its components over four sub-periods from 1967-2005, along with a correla- 
tion of these time-series with a trend variable. For Herf and Bank, we use cross-sectional medians each year due to the 
strong skewness of these continuous variables. We also tabulate the incidence of “High” P-Score observations over time. 


In Panel B, we calculate average values of E-Score across the three P-Score groups each sub-period. Bolded differences are 
significant at the 1 percent level (two-sided). 


Looking down the columns in Panel B of Table 8, the results also reveal a statistically 
significant increase in average E-Score over time for observations only in the middle and high 
P-Score groups. This indicates that economic events associated with special items have not in- 
creased among firms facing low levels of competitive pressure, but have increased for firms facing 
medium or high levels of economic pressure.” In addition, we re-estimate model 8 after adding 
P-Score. In this regression, P-Score is positive and significant, suggesting that competitive pres- 
sure contributes to the incidence of special items (untabulated). Overall, this evidence indicates 
that increases in economic events linked to special items are related to increasing competitive 
pressure in the economy. | 





25 In untabulated analysis, we also estimate annual regressions of E-Score first on the two economic pressure variables, and - 
then separately on P-Score. The averages for the estimated annual slope coefficients are positive and highly significant 
(p < 0.01) for both the components of P-Score and for P-Score itself. 
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VIL. CONCLUSION 

This study examines the reasons for the substantial decline in the contemporaneous associa- 
tion between revenue and expense over time, as documented by Dichev and Tang (2008). To 
isolate the source of this decline, we first disaggregate total expense into six components. We find 
that this decline (and the associated increase in earnings volatility and decrease in earnings per- 
sistence) is attributable primarily to a steady increase in the frequency of large special items. 

We examine two potential reasons for this increase in special items. The first is that changes 
in specific economic activity have led to more special items. The second is that changes in specific 
accounting standards require (or encourage) firms to réport more special items. We find little 
evidence that the specific accounting standards we study are a primary reason for the increase in 
special items. In contrast, we find much stronger evidence that changes in economic activity are a 
primary reason for the increase in special items. In particular, we find that an index of five 
indicators of specific economic events associated with recording special items has increased 
steadily over time and that high values for the index have more than twice the incidence of special 
items as low index values. 

Finally, we examine whether increasing competition in the U.S. economy is related to in- 
creases in our index of economic events. We identify two indicators of competitive pressure at the 
industry level, finding that, both individually and when combined, they are increasing over time 
and are correlated with our economic activity index in the cross-section. 

Our findings demonstrate the importance of disaggregating special items, and are important 
for standard-setters, given the frequency of these items. This is especially true in light of the recent 
discussion paper by the FASB and the IASB, Preliminary Views on Financial Statement Presen- 
tation (FASB 2008), which directly addresses this issue. The FASB’s preliminary view is that 
special items should be disclosed in a "memo" column of the proposed reconciliation between 
cash flows and comprehensive income (paras. 4.48 to 4.52), while the IASB does not support this 
additional disclosure (para. 4.53). 

While the evidence presented in this study suggests that changes in economic activity are a 
primary reason for the increase in special items and specific accounting standards contribute little 
to the increase in special items, our results do not prove the null hypothesis that accounting 
standards play no role in the increase in special items. The inability of our economic variables to 
fully explain the time trend in special items leaves open the possibility that accounting standards 
could explain some of this unexplained variation. Thus, it is possible that future research will 
develop stronger tests that are able to detect a role for accounting standards. 

We also leave two additional important issues for future research. The first is the possibility 
that changes in financial reporting practice that are not related to either new accounting standards 
or to changes in economic activity have contributed to the increase in special items over time. For 
example, perhaps managers facing increasing capital market pressure have changed their account- 
ing practices and are now more aggressively recognizing nonrecurring gains and losses, encour- 
aging investors to ignore these items for performance evaluation. This issue is beyond the scope of 
this study. Also beyond the scope of this study is whether performance measures that so frequently 
include special items are “good accounting.” While our evidence indicates that the increase in 
special items over time is linked to increases in economic events that affect firm welfare, we leave 
a more complete evaluation of the appropriateness of this accounting to future research. 
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APPENDIX 
DETAILED VARIABLE DEFINITIONS 
Earnings Variables and Components 


Revenue (Rev) = sales (data12)/average total assets (data6); 
Expense (Exp) = sales less income before extraordinary items (data18)/ 
average total assets (data6); 
Cost of Goods Sold (COGS) = data 41/average total assets (data6); 
Selling, General, and Admin (SGA) = data189/average total assets (data6); 
Depreciation (DEPR) — data 14/average total assets (data6), equals O if 
missing; 
Tax Expense (TAX) = datal6/average total assets (data6); 

Special Items (S = (—1) ж datal7/average total assets (data6) for 
decomposition analysis, equals 0 if missing. An indicator 
variable equal to 1 if the absolute value of special 
items is greater than or equal to 1 percent of average 
total assets (data6), 0 otherwise. Cross-sectional 
average measures annual incidence; 

Operating Income after Depreciation = datal78/average total assets (data6); 
Other Expenses (OTH) = total expense less COGS, SGA, DEPR, ТАХ, and SI; 
and 
%SI, = percent of sample firm-year observations with a special 
item greater than 1 percent of beginning total assets 
that year. 


Economic Event Variables 


Negative Employee Growth (Emp) = indicator variable that takes the value of 1 if the firm 
had negative employee (data29) growth for the year 
relative to the preceding year; 

Merger and Acquisition Activity (MA) — indicator variable that takes the value of 1 if the firm 
engaged in a merger or acquisition during the fiscal 
year (from AFTNT1); 

Discontinued Operations (Disc) — indicator variable that takes the value of 1 if the firm 
had discontinued operations for the year (data66); 
Operating Loss (Loss) — indicator variable that takes the value of 1 if the firm 
had negative operating income after depreciation 
(data178) for the year; 

Negative Revenue Growth (Grow) — indicator variable that takes the value of 1 if the firm 
had negative revenue (datal2) growth rate, 0 
otherwise; 

E-Score — sum of the five indicator variables Emp, MA, Disc, 
Loss, and Grow; and 
E-ScoreAvg = cross-sectional average of E-Score for a given year. 


Accounting Standard Variables 


Dxx = 1 in 19xx or 20xx and beyond, and 0 prior to 19xx or 20xx. 
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Competitive Pressure Variables 


Herfindahl Index (Herf) = sum for each industry-year of member firm's squared market 
share. Market share is firm revenue divided by industry 
revenue using all available Compustat data; 

Altman's Bankruptcy Prediction 
(Bank) = (—1) X the industry median z-score frcm Altman (1968) 

from the prior year, so that Bank is increasing in 
competition, varies from the lowest probability of bankruptcy 
(the lowest scores) to the highest probability of bankruptcy 
(the highest scores); and 

P-Score = sum of a point for each of Herf and Bank that are above the 
sample median from 1967-1974. 
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ABSTRACT: This study examines the relation between internal controls and conditional 
conservatism (“conservatism”), also referred to as timely loss recognition. Using a 
sample of firms that disclose material weaknesses (MWs) in internal controls under the 
Sarbanes-Oxley Act (SOX), we find a positive relation between intemal control quality 
and conservatism. Specifically, firms with MWs exhibit lower conservatism than firms 
without such weaknesses. Further, firms that disclose MWs and subsequently remedi- 
ate these weaknesses exhibit greater conservatism than firms that continue to have 
MWs. Overall, these results are consistent with strong Internal controls acting as a 
mechanism that facilitates conservatism. Our study contributes to the Шөгашгө on the 
reporting effects of strong versus weak intemal controls. 


Keywords: internal controls; conservatism; material weaknesses; Sarbanes-Oxley 
Act. 
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L INTRODUCTION 
n 2002, the U.S. Congress passed the Sarbanes-Oxley Act (SOX) to improve the quality of 
Е reporting and to restore investor confidence in the reliability of financial statements. 
An important aspect of SOX is its internal control reporting requirements, which allow inves- 
tors to be informed about the quality of a firm’s internal controls. Specifically, Section 302 of the 
` Act requires management to evaluate and report the effectiveness of disclosure and control рго- 
cedures (SEC 2002), while Section 404 requires the auditor to form an opinion on the effective- 
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ness of internal controls (SEC 2003). Using these internal control disclosures, studies have 
investigated whether the quality of internal controls affects accruals quality (Doyle et al. 2007a; 
Ashbaugh-Skaife et al. 2008), a firm’s cost of capital (Ogneva et al. 2007; Beneish et al. 2008; 
Ashbaugh-Skaife et al. 2009), and the accuracy of management guidance (Feng et al. 2009). Given 
the importance of the internal control provisions as a means to improve the governance of firms, 
our study extends the literature on the reporting effects of strong versus weak internal controls by 
examining how the quality of internal controls is related to conservatism in financial reporting. 

We focus on conservatism because it has been argued to provide several governance benefits, 
such as reducing agency conflicts and improving managerial investment decisions (Holthausen 
and Watts 2001; Watts 2003; Ball and Shivakumar 2005), enhancing the efficiency of debt con- 
tracts (Ahmed et al. 2002; Zhang 2008), and reducing litigation costs (Watts 2003). Following 
Basu (1997), we define conservatism as the higher degree of verification to recognize good news 
as gains than to recognize bad news as losses. That is, earnings reflect bad news more quickly than 
good news leading to timelier loss recognition. The literature generally refers to this as conditional 
conservatism (Beaver and Ryan 2005; Ball and Shivakumar 2005), which we simply refer to as 
“conservatism” in this study.” 

Within the agency framework of positive accounting theory that recognizes the contracting 
benefits of conservatism, it is plausible to expect that strong internal controls facilitate conserva- 
tive financial reporting. First, firms with a strong internal control environment (e.g., strong tone- 
at-the-top and/or good internal control culture) are more likely to understand the role of conser- 
vatism in contracting and in reducing agency conflicts. Consequently, these firms are more likely 
to favor its implementation. Further, to the extent that firms are committed to produce conservative 
reports, strong internal controls can facilitate this process by providing reliable accounting infor- 
mation. Weak internal controls can cause estimation errors in accounting numbers (Doyle et al. 
2007а: Ashbaugh-Skaife et al. 2008), making the contractible variables less reliable for contract- 
ing and monitoring purposes. This can reduce the effectiveness of conservatism as a governance 
mechanism and, hence, the incentives to implement conservatism. Finally, weak internal controls 
could impede the timely recognition of losses, thereby leading to lower conservatism. We develop 
these arguments on the positive relation between internal control quality and conservatism more 
fully in Section II. 

Moving away from the agency framework of positive accounting theory that argues the 
contracting benefits of conservatism, it becomes less clear why one would expect a positive 
relation between internal control quality and conservatism. First, it has been argued that conser- 
vatism can increase earnings management (Levitt 1998; Penman and Zhang 2002), which can 
exacerbate agency conflicts. Under this view, if strong internal controls serve as an effective 
monitoring mechanism that can mitigate agency problems (Jensen 1993), then we should instead 
expect strong internal controls to provide a disincentive for firms to produce conservative reports. 
Second, it is controversial whether conservatism is a desirable attribute of financial reporting. 
Barth (2008) argues that conservatism is not a qualitative characteristic under the conceptual 
framework of IASB, which specifies that accounting information should be unbiased. Conserva- 
tism, by promoting the deliberate understatement of book value and/or earnings, would render 


! In Endnote 59 of the SEC's (2002) release on the Final Rule: Certification of Disclosure in Companies’ Quarterly and 
Annual Reports, internal control is defined as "a process, effected by an entity's board of directors, management and 
other personnel, designed to provide reasonable assurance regarding the reliability of financial reporting." 

Another form of reporting conservatism, referred to as "unconditional conservatism," manifests itself in a systematic 
undervaluation of the firm's net assets. Ball and Shivakumar (2005) argue that unconditional conservatism cannot yield 
contracting efficiency because it can be observed and, hence, can be "undone" by users of the financial statements. 
Accordingly, we do not consider unconditional conservatism in our context. 
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financial statements not neutral. Under this investor value relevance framework that emphasizes 
that accounting numbers should be neutral inputs to investment decisions, the argument that strong 
internal controls would promote conservative accounting seems tenuous. 

Given the above arguments and corresponding tension, we empirically test the relation be- 
tween internal control quality and conservatism using firms’ internal control disclosures under 
SOX. We obtain a sample of firms that disclosed at least one material weakness (MW) from 
January 2003 to November 2005 and deem these firms (hereafter referred to as “MW firms”) to 
have low internal control quality due to the presence ої MWs.’ We apply three measures of 
conservatism that are commonly used in the literature to capture the asymmetric timeliness in the 
recognition of economic losses: (1) the persistence of earnings changes measure in Basu (1997), 
(2) Фе accrual-based loss recognition measure in Ball and Shivakumar (2005), and (3) the time- 
liness of earnings to news measure in Basu (1997). To afford stronger inferences to be made about 
the effects of internal controls on conservatism, we further conduct intertemporal tests of changes 
in the status of internal controls. Specifically, we examine whether MW firms that subsequently 
remediate their weaknesses exhibit different levels of conservatism from MW firms that continue 
to have these weaknesses. 

Results using all three measures of conservatism are consistent with a positive relation be- 
tween internal control quality and conservatism. Specifically, we find that MW firms exhibit lower 
conservatism than control firms without such weaknesses. Further, we find that MW firms that 
subsequently remediate their weaknesses (і.е., show an improvement in internal control quality) 
exhibit greater conservatism than MW firms that continue to have these weaknesses. These results 
are robust to a battery of checks, which include controlling for the self-selection bias in the MW 
firms sample, controlling for the self-selection bias in the improvement of internal control quality, 
and using alternative conservatism measures suggested by Khan and Watts (2009) and Givoly and 
Hayn (2000). Taken together, our results are consistent with strong internal controls serving as a 
mechanism that facilitates conservatism. 

Doyle et al. (2007a) and Ashbaugh-Skaife et al. (2008) find a positive relation between 
internal control quality and accruals quality, which is measured by the extent to which accruals are 
realized as cash flows or the size of abnormal accruals. Our study differs from their studies by 
examining conservatism, which is a form of accounting bias toward reporting low book values of 
stockholder equity, conditional on firms experiencing contemporaneous economic losses (Ball and 
Shivakumar 2005).* In both Doyle et al. (20074) and Ashbaugh-Skaife et al. (2008), the authors 
conjecture that their findings could be due to weak internal controls (1) causing unintentional 
(poor estimation ability) errors, or (2) failing to limit intentional income-increasing earnings 
management. However, Ashbaugh-Skaife et al. (2008) find that firms that report internal control 
weaknesses have significantly larger positive and larger negative abnormal accruals relative to 


* According to Auditing Standards No. 2 (PCAOB 2004), which was in effect during the time period of our study, an MW 
is "a significant deficiency, ос combination of significant deficiencies, that results in more than a remote likelihood that 
a material misstatement of the financial statements will not be prevented or detected." A significant deficiency is "a 
control deficiency, or combination of control deficiencies, that adversely affects the company's ability to initiate, 
authorize, record, process, or report external financial data reliably in accordance with generally accepted accounting 
principles such that there is more than a remote likelihood that a misstatement of the company's annual or interim 
financial statements that is more than inconsequential will not be prevented or detected." 

Ball and Shivakumar (2006) argue that conventional accruals models assume that the relation between accruals and cash 
flows are linear and do not take into account of the asymmetry in the gain and loss recognition role of accruals. Hence, 
these models misspecify the accounting accrual process and misestimate discretionary accruals, resulting in incorrect 
inferences about earnings quality. By examining the timely loss recognition property of earnings, our study takes into 
consideration the gain and loss recognition role of accruals and the nonlinearity implied by the asymmetry therein. This 
addresses the concerns in the use of the linear accrual models. 
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control firms, which suggests that the first factor is more plausible. Nonetheless, these two factors 
need not necessarily result in lower conservatism per se. 

First, the unintentional estimation errors caused by weak internal controls should affect the 
recognition of bad news and good news in an unbiased and symmetrical manner. Accordingly, 
these estimation errors are not likely to cause conservatism per se, which results from the asym- 
metric response of earnings to bad versus good economic news. Second, even if weak internal 
controls fail to limit earnings management, the presence of income-increasing earnings manage- 
ment is not necessarily consistent with the presence of lower conservatism. Givoly et al. (2010, 
221) argue that earnings management is situational or episodic, occurring when unmanaged earn- 
ings fail to meet an important reporting objective (e.g., meeting analysts’ forecasts). Further, 
earnings management tends to be small-scale in nature due to the short supply of the needed 
positive accruals. Hence, if the company’s accounting is generally conservative, earnings manage- 
ment will only temporarily interrupt this observed reporting pattern but will not obscure the 
presence of the more prevalent phenomenon of conservatism. The implication is that earnings 
management and conservatism can co-exist within a firm? Hence, our study complements Doyle 
et al. (20078) and Ashbaugh-Skaife et al. (2008) by providing insights into how internal control 
quality is associated with the phenomenon of conservatism. 

Overall, our study extends the literature on the reporting effects of strong versus weak internal 
controls (e.g., Doyle et al. 2007a; Ogneva et al. 2007; Ashbaugh-Skaife et al. 2008; Beneish et al. 
2008). We show that strong internal controls can create an incentive for and facilitate conservative 
reporting. Our study also contributes to the literature that shows that conservatism is shaped, in 
part, by a firm's contracting and governance environment (e.g., Ball et al. 2000; Basu et al. 2001; 
Ball et al. 2003; Ball and Shivakumar 2005; Bushman and Piotroski 2006; Ahmed and Duellman 
2007; LaFond and Roychowdhury 2008). 

Section II develops the hypotheses, Section III explains the research design, Section IV 
presents the empirical results, Section V describes additional analyses, and Section VI concludes. 


П. HYPOTHESIS DEVELOPMENT 

In an attempt to restore investor confidence in firms' financial reporting, SOX requires firms 
to assess and disclose, and auditors to certify, the effectiveness of internal controls over financial 
reporting (SEC 2002, 2003). Presumably, regulators hope that these requirements can improve the 
quality of internal controls and enhance the reliability of financial reporting. Indeed, studies have 
shown that effective internal controls can enhance financial reporting quality, proxied by accruals 
quality and the size of abnormal accruals (Doyle et al. 2007a; Ashbaugh-Skaife et al. 2008). 
Further, Beneish et al. (2008) and Ashbaugh-Skaife et al. (2009) find that effective internal con- 
trols result in reduced information risk, which can lower a firm's cost of equity. However, Ogneva 
et al. (2007) fail to find a significant relation between internal control weaknesses and cost of 
equity after controlling for primitive firm characteristics and analyst forecast bias. Finally, Feng et 
al. (2009) find a positive relation between internal control quality and the accuracy of management 
guidance, consistent with ineffective internal controls causing errors in internal management re- 
ports. Our study extends this stream of research by examining how the quality of internal controls 
affects conservatism in financial reporting. 

Conservatism has been argued to play an important governance role in mitigating agency 
conflicts and enhancing contracting efficiency within thé firm (Holthausen and Watts 2001; Watts 





u 


Consistent with this line of reasoning, Givoly et al. (2010) find that public equity firms report more conservatively than 
their private equity counterparts, but at the same time, public equity firms have a greater propensity to manage income. 
The authors also conduct two tests to show that earnings management and conservative reporting may independently 
exist in their data. 
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2003; Ball and Shivakumar 2005). The reasoning is that conservatism reduces managers’ ability 
and incentives to overstate earnings and net assets by requiring higher verification standards for 
gain recognition and reduces managers’ ability to withhold information on expected losses. Thus, 
it prevents overcompensation of managers that is costly to recover ex post because of managers’ 
limited liability and tenure. Similarly, in a debt-contracting setting, conservatism reduces man- 
agers' ability to loosen or avoid dividend restrictions and transfer wealth from bondholders to 
shareholders, thereby mitigating deadweight losses and increasing firm value (Ahmed et al. 2002; 
Zhang 2008). According to Ball and Shivakumar (2005), conservative accounting effectively 
limits the control rights of managers and transfers those rights back to the providers of finance 
earlier. Finally, Watts (2003) asserts that conservatism can also reduce a firm's expected litigation 
costs. 

Existing studies suggest that the incentives for or against conservative reporting are shaped, in 
part, by a firm's contracting and governance environment (e.g., Ball et al. 2000; Basu et al. 2001; 
Ball 2001; Ball et al. 2003; Ball and Shivakumar 2005; Bushman and Piotroski 2006; Ahmed and 
Duellman 2007; LaFond and Roychowdhury 2008; LaFond and Watts 2008; Nikolaev 2010). For 
instance, Ahmed et al. (2002) argue that conservatism evolves as an efficient contracting mecha- 
nism to mitigate dividend policy conflicts between shareholders and bondholders. They find that 
the more severe such conflicts are (i.e., the higher the leverage), the more conservative the firm's 
accounting choices become. LaFond and Roychowdhury (2008) hypothesize and find that when 
managerial ownership is low, the severity of the agency problem increases, driving a demand for 
greater conservatism. Ball and Shivakumar (2005) find that private firms are less conservative than 
public firms and attribute this finding partly to the differences in governance structures and moni- 
toring mechanisms between these two types of firms. Given that the internal control system plays 
an important governance role within the firm to monitor managers' behavior and to minimize 
agency costs (Fama and Jensen 1983; Jensen 1993), we conjecture a potential link between the 
quality of internal controls and conservative reporting. 

Within the agency framework of positive accounting theory that recognizes the contracting 
benefits of conservatism, strong internal controls could therefore create an incentive for conser- 
vative accounting reports. According to the Internal Control—Integrated Framework (Committee 
of Sponsoring Organizations of the Treadway Commission 1992), the internal control environment 
is an important component of internal controls. We argue that firms with a strong tone-at-the-top 
(e.g., an effective board of directors that protects the interests of shareholders) and good internal 
control culture (e.g., a corporate culture that has a wealth-maximization orientation) are better able 
to understand the benefits of conservatism in reducing agency conflicts and deadweight loss, in 
contracting with shareholders and/or debtholders, and in reducing the litigation risks for directors, 
auditors, and managers. Consequently, these firms are more likely to favor the implementation of 
conservatism and promote an environment that emphasizes the adherence to conservative report- 
ing. 

To the extent that firms understand the benefits of conservatism and are committed to produce 
conservative reports, strong internal controls can facilitate the use of conservatism in the contract- 
ing process by providing reliable accounting information. This is because the financial reporting 
system, by emphasizing verifiable outcomes, supplies a rich set of variables that can be used for 
contracting purposes (Watts and Zimmerman 1986; Bushman and Smith 2001; Armstrong et al. 
2010). For instance, financial covenants used in debt contracts to limit managerial discretion with 


® Conservative accounting reduces the tendency of managers with short-term horizons to invest in negative Net Present 
Value (NPV) projects, making managers &ware that they will not be able to defer the recognition of losses to the future 
(Ball and Shivakumar 2005) and imposing greater costs to biasing financíal reports upward (Guay and Verrecchia 2006). 
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respect to dividend distributions (Ahmed et al. 2002) are often defined in terms of accounting 
numbers. Similarly, managers’ compensation can be tied to changes in book value (or earnings). 

As mentioned earlier, weak internal controls can cause unintentional errors in accrual estima- 
tions to occur and impact the reported book value or earnings. For instance, the failure to have 
appropriate reconciliations and reviews in place allow for procedural errors in accrual estimations 
(Doyle et al. 2007a), and the lack of adequate policies, training, or diligence by company employ- 
ees can affect the noise and magnitude of abnormal accruals (Ashbaugh-Skaife et al. 2008). 
Although these estimation errors cannot cause lower conservatism per se, they can make the 
contractible variables noisy and less reliable, providing a poor signal to the board of directors that 
conservatism has been used effectively in contracting and reducing the usefulness of conservatism 
in monitoring managers.’ In sum, by enhancing the ability to use accounting numbers effectively 
in contracts, strong internal controls can create an incentive for firms to use conservatism as a 
governance mechanism. 

Finally, weak internal controls could also impede the timely recognition of losses, thereby 
leading to lower conservatism. For instance, the lack of proper policies and procedures to regularly 
value a firm’s inventory or fixed assets prevents asset impairment from being discovered early and, 
hence, delays the recognition of losses. Unqualified accounting staff may lack the expertise to 
estimate the future cash flows of assets such as goodwill and fixed assets, and identify any 
reduction in future cash flows that would trigger early recognition of impairment losses. Weak 
internal controls in the monitoring of the firm’s investment decisions can also provide the incen- 
tive for managers to avoid timely loss recognition in investment projects. One example is the lack 
of a proper process to provide the board of directors with timely information on the firm’s 
investments for oversight and review, which could make it easier for managers to invest in or 
avoid divesting negative NPV projects. Further, control weaknesses over the computation of the 
present value of future profits of acquired businesses can result in potential loss-making projects 
not being discovered and terminated early. The Appendix provides some examples of MWs dis- 
closed by firms that could potentially impede the timely recognition of losses. 

The above arguments suggest a positive effect of conservative accounting reports in terms of 
contracting efficiency. However, moving away from the agency framework of conservatism as a 
beneficial governance mechanism, it becomes less clear why one would expect a positive link 
between internal controls and conservatism. First of all, it is questionable whether conservatism is 
beneficial to firms and their shareholders. For instance, Penman and Zhang (2002) contend that 
conservative accounting can create unrecorded reserves that provide managers with the flexibility 
to report more income in the future. The increased ability to manage earnings can adversely affect 
the predictive ability of current earnings for future earnings. In addition, three of the five financial 
reporting problems highlighted by former SEC Chair Arthur Levitt deal with the understatement of 
assets—big bath charges, creative acquisition accounting, and cookie jar reserves (Levitt 1998). If 
so, then conservative accounting reporters may actually have greater ability to smooth or manipu- 
late earnings, which could exacerbate, rather than mitigate, agency problems. If strong internal 
controls serve as an effective monitoring mechanism to reduce agency conflicts (Jensen 1993), 


~ 


One may question why a firm would not ensure that internal controls are effective or remediate internal control 
weaknesses promptly if the firm views strong internal controls as important in ensuring conservative reports. This could 
be due to several reasons. First, the firm may fail to constantly invest in internal controls to keep up with the growth of 
the firm. Second, managers may not be willing to invest in the time and resources to ensure effective internal controls 
because such actions divert attention away from the core business. Third, firms may face strained financial resources. 
Fourth, firms may have complex operations, such as multiple operating segments, making it difficult to implement 
effective internal controls. 
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then the claim that strong internal controls promote conservatism would become untenable. In- 
stead, strong internal controls could provide no incentive or even a disincentive for firms to 
produce conservative accounting reports. 

Further, regulators and standard-setters do not necessarily view conservatism as a desirable 
attribute of financial reporting. For instance, Barth (2008) points out that conservatism is not a 
qualitative characteristic of accounting information under the conceptual framework of the Inter- 
national Accounting Standards Board (IASB). The framework specifies that accounting informa- 
tion should be unbiased, and freedom from bias is an essential characteristic of reliability and of 
neutrality (IASB 2001, para. 31, 36). Conservatism, by allowing the deliberate understatement of 
assets or income and/or the deliberate overstatement of liabilities or expenses, renders financial 
statements not neutral and, therefore, jeopardizes the quality of reliability and unbiasedness de- 
sired by the conceptual framework. 

The Financial Accounting Standards Board (FASB) asserts that the best interest of users is 
served by neutral reporting accompanied by appropriate disclosure of "the nature and extent of the 
uncertainty surrounding events and transactions reported to stockholders and others" (see SFAC 
No. 2, FASB 1980, para. 93; FASB 2000, para. 96, 97). Consistent with this perspective, the 
FASB is shifting its focus toward fair value accounting, lessening the asymmetric treatment of bad 
and good news in financial statements. If the objective of financial reporting is to provide infor- 
mation useful for making economic decisions (IASB 2001, para. 12), then having strong internal 
controls that ensure that the financial reports are unbiased would satisfy investors' demand for 
neutral information. Hence, within this investor value-relevance framework, the argument that 
strong internal controls would promote conservative financial reports (which are essentially bi- 
ased) seems tenuous. 

Given these potentially conflicting perspectives, we empirically test the relation between 
internal controls and conservatism and test the null hypothesis that internal control quality is 
unrelated to conservatism. In order to further ascertain the nature of any relation between internal 
controls and conservatism, we perform intertemporal tests, as in Ashbaugh-Skaife et al. (2008), to 
determine whether firms with weak internal controls that subsequently show an improvement in 
internal controls exhibit different levels of conservatism from firms that fail to do so. This would 
help strengthen our finding on the relation between internal controls and conservatism. The null 
hypotheses we test are as follows: 


H1: There is no relation between internal control quality and conservatism. . 


H2: Firms with weak internal controls that subsequently show an improvement in internal 
controls (i.e., remediate their internal control weaknesses) exhibit no difference in con- 
servatism from firms that fail to do so. 


Ш. RESEARCH DESIGN 
Measures of Accounting Conservatism 


Persistence of Earnings Changes (Basu 1997) 
Our first measure to capture the differential timeliness of loss versus gain recognition is the 
persistence of earnings changes in Basu (1997). Basu (1997) shows that, relative to good news 


* For example, in SFAC No. 2 the FASB states, “Conservatism in financial reporting should no longer connote deliberate, 
consistent understatement of net assets and profits” (FASB 1980, para. 93). The FASB also states in the same paragraph 
that conservatism “became deeply ingrained and is still in evidence despite efforts over the past 40 years to change it.” 
In the joint exposure draft by the FASB and IASB, Conceptual Framework for Financial Reporting (FASB 2008, para. 
BC2.21), it states that "Ше boards concluded that describing prudence or conservatism as a qualitative characteristic or 
a desirable response to uncertainty would conflict with the quality of neurrality because ... an admonition to be prudent 
is likely to lead to a bias in the reported financial position and financial performance ... Accordingly, the proposed 
framework does not include prudence ot conservatism as desirable qualities of financial reporting information” (em- 
phasis in the original). 
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periods, conservatism results in lower persistence of earnings in bad news periods. The deferred 
recognition of relatively good news results in positive changes in income being less likely to 
reverse than negative earnings changes. This is because, from a time-series perspective, the bad 
news reflected in current earnings will appear as a transitory shock in the earnings process, 
whereas the effects of a current positive shock will be spread over the earnings of several future 
periods as anticipated gains are realized. The following model from Basu (1997) is used to 
estimate this relation: 


ANI, = Qo + РАМ + АМ, + азРАМГ, з ж АМ, + є (1) 


where firm i subscripts are omitted. АМ (АМ, 1) is the change in net income before extraordinary 
items for firm і in fiscal year t (1-1) deflated by beginning-of-year total assets, and DANI, i is an 
indicator variable that equals 1 if ANZ, , is less than 0, and 0 otherwise. 

The negative coefficient on DANI, | ж ANI, , is consistent with timely loss recognition. We 
further develop model (1) into model (2) to test H1, where MW is an indicator variable that equals 
1 if the firm has at least one MW in internal controls, and 0 otherwise. If MW firms report 
less (more) conservatively than firms without such weaknesses, then the coefficient on 
DANI, (х ANI, * MW will be positive (negative); that is, MW firms have a reduced (increased) 
tendency to reverse negative earnings changes in the following period. 


ANI, = @ + & DANI, + аз ANI, + азРАМТ, * ANI, + САМУ + a@sDANI,_, ж MW 
+ ag ANI, * MW + о ДРАМІ, * ANI, * MW + agMB,+ РАМ, * MB, 
+ ара МІ, ж МВ, + @ DANI, | ж ANI, ж МВ, + a LEV, аз РАМТ ж LEV, 
+ ад ANI, * LEV, + аз РАМ * ANI, * LEV, а16512Е, + оц2 АМ, * SIZE, 
+ аз МІ, * SIZE, + ао АМТ ж ANI, | ж SIZE, + ард АТ, + ас DANI, * LIT, 
+ а%АМ, 1 * LIT, + РАМ, -1 * ANI, * LIT, + о МО, + @25DANI,_; + IND; 
+ аб АМ, * IND) ар ДМІ, ж ANI, үж IND; + 8 (2) 


where firm i subscripts are omitted. 

Model (2) controls for firm characteristics that are used in prior research to proxy for the 
demand for conservatism. We include MB, the market value of equity divided by the book value 
of equity at the end of the fiscal year t, to control for the demand of conservatism arising from 
information asymmetry associated with a firm’s growth option (LaFond and Watts 2008), and 
because studies document a negative association between conditional and unconditional conser- 
vatism (Givoly et al. 2007; Roychowdhury and Watts 2007). Firms with high levels of leverage 
tend to have greater bondholder and shareholder conflicts that, in turn, increase the contractual 
demand for conservatism (Ahmed et al. 2002; Zhang 2008). Hence, we include LEV, the sum of 
long-term debt and current liabilities deflated by total assets at the end of the fiscal year t. 

LaFond and Watts (2008) argue that large firms produce more public information and have 
less information asymmetry, reducing the demand for conservative accounting. Hence, we control 
for firm size (SIZE) using the natural log of total assets at the end of the fiscal year 1.7 We also 
control for litigation risk because it can increase managers' incentive to recognize losses in a more. 
timely manner than gains (Basu 1997; Watts 2003). Following Francis et al. (1994), we define an 


? We mean-adjust all the continuous control variables in our regression models (1.е., MB, LEV, and SIZE) to reduce the 
problems with multicollinearity among the interaction terms (Neter et al. 1989; Aiken and West 1991). This procedure 
helps to make the magnitude of the coefficients more comparable to prior work. 
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indicator variable LIT that equals 1 if a firm operates in a litigious industry, and 0 otherwise. ? 


Finally, Givoly et al. (2007) contend that the degree of conservatism varies across different 
industries. Hence, we include industry indicators (IND,) following the industry classification in 
Frankel et al. (2002).!! 

H2 tests whether MW firms that remediate their MWs will exhibit different levels of conser- 
vatism from MW firms that continue to have these weaknesses. Following Ashbaugh-Skaife et al. 
(2008), we deem an unqualified SOX 404 opinion by the external auditor to objectively and 
unambiguously show that the firm has fully remediated its MWs. We test H2 using the sample of 
MW firms and replace MW with FIXED in model (2), where FIXED is an indicator variable that 
equals 1 if an MW firm receives an unqualified second SOX 404 report, and 0 otherwise. A 
negative (positive) coefficient on DANI, * ANI, | ж FIXED indicates that MW firms that reme- 
diate their weaknesses exhibit greater (lower) conservatism than MW firms that still have weak- 
nesses, 


Accrual-Based Loss Recognition (Ball and Shivakumar 2005) 

Ball and Shivakumar (2005) develop a model to describe the differential timeliness of gain 
and loss recognition that relies on the correlation between accruals and contemporaneous cash 
flows.' In their regression, as shown in model (3) below, the authors predict a positive coefficient 
on DCFO * CFO for accounting conservatism. 


ACCRUAL = уул y,DCFO + СЕО + СЕО ж CFO (3) 


where firm i and time 7 subscripts are omitted. ACCRUAL is net income before extraordinary items 
minus operating cash flows for firm i in fiscal year t deflated by beginning-of-year total assets. 
CFO is operating cash flows for firm i in fiscal year ¢ deflated by beginning-of-year total assets. 
DCFO is an indicator variable that equals 1 if CFO is less than 0, and 0 otherwise. To examine 
НІ, we develop model (4) as shown below. If MWs in internal controls result in lower (greater) 
accounting conservatism, then the accruals can less (more) effectively and timely reflect the future 
expectation of the negative change of cash flows; the coefficient on DCFO * CFO * MW is 
predicted to be negative (positive). 


ACCRUAL = yp + у DCFO + у,СЕО + ууРСЕО + CFO + y,MW + ysDCFO * MW 
+ ¥6CFO ж MW + 34DCFO ж CFO ж MW + MB + ърСЕО * MB 
+ уоСЕО * MB + y,,;DCFO ж CFO + MB + у У + уз ОСЕО * LEV 
+ УмСЕО * LEV + у РСЕО + CFO + LEV + y;6SIZE + y,7DCFO + SIZE 


10 Consistent with Francis et al. (1994), firms with primary SIC codes of 2833-2836 (biotechnology), 3570-3577 (com- 
puter equipment), 3600-3674 (electronics), 5200-5961 (retailing), and 7370-7374 (computer services) are considered to 
be operating in a litigious industry. 

Ч Following Frankel et al. (2002), industry membership is determined by the following SIC codes: agriculture (АСВ, 
0100-0999), mining and construction (MIN, 1000-1999, excluding 1300-1399), food (FOO, 2000-2111), textiles and 
printing/publishing (TEX, 2200-2799), chemicals (CHE, 2800-2824, 2840-2899), pharmaceuticals (PHA, 2830-2836), 
extractive (EXT, 2900-2999, 1300-1399), durable manufacturers (DUR, 3000-3999, excluding 3570-3579 and 3670- 
3679), transportation (TRA, 4000-4899), utilities (UTI, 4900-4999), retail (RET, 5000-5999), financial (FIN, 6000- 
6999), services (SER, 7000-8999, excluding 7370-7379) and computers (COM, 3570-3579, 3670-3679, 7370-7379). 

12 The authors assert a positive but asymmetric correlation between accruals and contemporaneous cash flows. This 
positive correlation arises because cash ‘low revisions in the current period tend to be positively correlated with the 
current revisions for expected future cash flows. Timely recognition of unrealized gains and losses is based on expected, 

' not realized, cash flows, and is therefore accomplished through accruals. Consequently, timely gain and loss recognition 
will produce a positive correlation between accruals and current period cash flows. This correlation is asymmetric 
because losses, under conservative reporting, are likely to be recognized on a timelier basis than gains. 
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+ ¥1gCFO * SIZE + уОСЕО ж CFO * SIZE + YLIT + СЕО * LIT 
+ YnCFO ж LIT + узі СЕО ж CFO * LIT + ум) + YosDCFO * IND; 
+ 6 СРО * IND; + yz DCFO ж CFO * IND, + = (4) 


where firm i and time г subscripts are omitted and all variables are as previously defined. We 
replace MW in model (4) with FIXED to test H2. If the remediation of MWs is associated with 
greater (lower) conservatism, then the coefficient on DCFO ж CFO ж FIXED is predicted to be 
positive (negative). 


Timeliness of Earnings to News (Basu 1997) 

Our third measure of conservatism is a firm’s timeliness of earnings to news (Basu 1997). As 
shown in model (5) below, the timeliness of earnings is inferred from the responsiveness of 
accounting income to the change in market values. Negative (positive) market-adjusted stock 
returns are used as proxies for bad (good) news. The asymmetric recognition of economic losses 
relative to gains is captured by a positive coefficient on DR * R. 


М = By + B.DR + BR + BDR* К+ е (5) 


where firm i and time t subscripts are omitted. МІ is the net income before extraordinary items for 
firm i in fiscal year t, deflated by the beginning-of-year market value, R is the market-adjusted 
stock return for firm i over the fiscal year 7, and DR is an indicator variable that equals 1 if R is less 
than 0, and 0 otherwise.” 

To test H1, we develop model (6). A negative (positive) coefficient on DR * R * MW indicates 
that MW firms exhibit lower (higher) conservatism than firms without such weaknesses; that is, 
the MW firms have lower (higher) incremental timeliness of earnings to bad news than to good 
news. To test H2, we replace MW with FIXED in model (6). A positive (negative) coefficient on 
DR * R * FIXED would suggest that MW firms that remediate their weaknesses exhibit greater 
(lower) conservatism than MW firms that fail to do so. 


М = Во + BIDR + BR + PDR * R + B4MW + B5DR * MW + ВК * MW + ВОК * В+ MW 
+ BMB + БРЕ * MB + Bj9R * MB + By, DR + R* MB + ВУ  Bj4DR * LEV 
+ Вик * LEV + В,,ОКкК * LEV + BygSIZE + ВОК * SIZE + Вик ж SIZE 
+ BigDR ж К * SIZE + Во МТ + B5,DR * LIT + ВК * LIT + ВОК * К + LIT + Bj4IND; 
+ BosDR + IND, + ВК + IND; + ВэуОК * К ж IND, + е (6) 


where firm i and time ғ subscripts are omitted and all variables are as previously defined. 
Givoly et al. (2007) point out the importance of controlling for the information or disclosure 
environment when using the Basu's (1997) timeliness measure. The control variables in our 


13 Dietrich et al. (2007) argue that the interpretation of model (5) is valid only when the market is efficient; in particular, 
if market returns cause earnings, and not the reverse. The authors also show that partitioning a regression sample with 
one of the regressors (R) may lead to biased inferences. However, Ryan (2006) notes that two well-known empirical 
results, the low R?s observed in contemporaneous returns-earnings regressions (Collins et al. 1997; Ely and Waymire 
1999; Francis and Schipper 1999) and a large literature showing that returns typically reflect information on a timelier 
basis than earnings, indicate that the concern mentioned by Dietrich et al. (2007) is likely to induce a very tiny bías in 
the estimation of conservatism. Ryan (2006) recommends measuring returns over the fiscal year to partially remove the 
impact of the annual earnings announcement over stock prices, which occurs approximately three months after closing. 
He also recommends the use of market-adjusted returns, defined as raw returns minus the value-weighted CRSP market 
return, to create the partitioning dummy variable DR in the Basu regression. Hence, to mitigate the concerns of Dietrich 
et al. (2007), we measure returns over the fiscal year and use market-adjusted returns instead of raw returns. 
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regressions partially address this issue.'* Nonetheless, to further mitigate this concern, we control 
for other factors that are likely to proxy for the amount or quality of information disclosed by a 
firm—the annual frequency of management forecasts, the number of days from management 
forecast to fiscal year end date, audit quality (proxied by audit firm size), analyst following, and 
analyst forecast accuracy. The interpretation of our results for Basu’s (1997) timeliness measure 
(untabulated) is not affected by the inclusion of these variables. 


Sample Selection 


Using the sample firms in Doyle et al. (2007a), we identify 1,098 firms that, under either SOX 
302 or SOX 404, disclose at least one MW from January 2003 to November 2005." We focus on 
firms that disclosed MWs because the reporting of MWs is mandatory, whereas the reporting of 
significant deficiencies and control deficiencies is not (Doyle et al. 2007а).19 We deem a firm to 
have weak internal controls (і.е., low internal control quality) so long as it has at least one MW. 

Table 1, Panel A summarizes the sample selection procedure for the descriptive statistics and 
for testing H1. We choose the fiscal years 2000 and 2001 to test НЇ because these years just 
precede the enactment of SOX and, hence, avoid any confounding effects due to SOX. Our 
assumption is that MWs exist within the firm even before their disclosures from January 2003 to 
November 2005 (Doyle et al. 2007a). Based on the sample firms from Doyle et al. (2007a), we 
identify 1,230 (6,835) firm-year observations for the MW firms (control firms with no MW 
disclosures from January 2003 to November 2005) that have available data in Compustat and 
CRSP for fiscal years 2000 and 2001. Then we remove 84 (434) outlier observations for all the 
continuous variables at the top and bottom 0.5 percent levels, leaving us with 1,146 (6,401) 
firm-year observations for the MW firms (control firms) sample." In total, we have 7,547 firm- 
year observations for the descriptive statistics and НІ tests. 

Table 1, Panel B summarizes the sample selection procedure for testing H2, our remediation 
test. We focus on the MW sample because remediation is not applicable to the control sample. 
Based on an initial 1,098 MW firms, we remove 473 firms that have no second SOX 404 opinions; 
that is, we are not sure whether the MWs are remediated. These firms are either non-accelerated 
filers that are not required to file SOX 404 opinions (289 firms) or they have terminated registra- 


М First, Givoly et al. (2007) contend that the timeliness measure is understated for firms where the information environ- 
ment is characterized by a smooth and frequent arrival of news relative to firms for which the news arrives less 
frequently. The information environment of larger firms is more likely to conform to the former characterization. In this 
aspect, controlling for firm size addresses this concern. Second, Givoly et al. (2007) caution that if a highly litigious 
environment prompts management to preempt bad news by disclosing it early, the Basu's timeliness measure will 
under-estimate the degree of conservatism. Our inclusion of LIT as control in the regression mitigates this concern. 
Third, Givoly et al. (2007) point out that Basu's (1997) measure of conservatism is likely to be negatively related to 
unconditional conservatism. Hence, we include market-to-book value for control of unconditional conservatism. Fourth, 
Givoly et al. (2007) highlight that the information environment can vary across industries, which will affect the 
timeliness measure. We address this concern through the inclusion of industry-dummy variables as controls. 

15 We thank Jeffrey Doyle, Weili Ge, and Sarah McVay for sharing the data The data can be found at: 
http://faculty.washington.edu/geweili/ICdata.html. 

16 Although both MWs and significant deficiencies are deficiencies in the design or operation of internal controls, signifi- 
cant deficiencies are less severe and are not required to be publicly disclosed under SOX 302 (SEC 2004). Hence, the 
disclosure of significant deficiencies is clearly voluntary. On the other hand, under SOX 302, if management identifies 
an MW in their controls, then they are precluded from reporting that the controls are effective and must disclose the 
identified MW. Hence, the disclosure of MWs is effectively mandatory. According to Doyle et al. (2007b), there is some 
ambiguity regarding whether SOX 302 certifications require the public disclosure of MWs and whether some firms 
might interpret the MW disclosure requirement under SOX 302 as voluntary. The authors’ conclusion, from reading 
most of the SEC guidance, is that most firms are treating the disclosure as mandatory. 

Because we use the same sample for the descriptive statistics and for all of the НІ tests (based on the three different 

conservatism measures), an outlier observation is removed from all the H1 tests, rather than from just the test to which 

it applies. 
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tion of their securities (184 firms). We also remove 109 firms with missing data in Compustat and 
CRSP. This procedure yields a final sample of 516 firms for the H2 tests, with 366 unqualified 
opinions and 150 adverse opinions. 


ГУ. RESULTS 

Descriptive Statistics 

Table 2, Panel A presents the descriptive statistics for the sample used to test H1, which is 
based on the firm-year observations for the fiscal years 2000 and 2001. Consistent with Doyle et 
al. (2007b) and Ashbaugh-Skaife et al. (2007), the table shows that MW firms are smaller, finan- 
cially weaker, and more leveraged than the control firms. The MW firms are also more likely to 
operate in a litigious industry, and have more negative change in net income in year t-1 and more 
negative accruals than the control firms. We do not observe any differences in the change in net 
income in year 4, stock returns, cash flow from operations, stock return, and market-to-book ratio 
between the MW and control firms. Table 2, Panel B presents both Pearson and Spearman corre- 
lation statistics; we do not find any unusual correlations among the independent variables in our 
regressions that warrant concern. 


Multivariate Regression Results 
Regression Results for H1 


Table 3 presents the regression results of НІ based on the persistence of earnings changes 
measure. We first run a baseline regression model without the MW and its related terms. This 
allows us to assess the incremental explanatory power of internal control quality on conservatism 
once the MW terms are included into the estimations. The baseline regression results, presented in 
Column 1, show that the coefficient on DANI,_, ж ANI, is —0.368 (р < 0.01), confirming that 
financial reporting is conservative in general. Column 2 shows that, after including the MW and 
the related terms, the adjusted В? of the regression increases from 24.0 percent to 24.6 percent. 
This result suggests that internal control quality provides incremental explanatory power to con- 
servatism in addition to factors affecting the demand for conservatism. Column 2 also reveals that 
the coefficient on DANI, * ANI, | ж MW is 0.217 (р = 0.01), which implies that the MW firms 
have a reduced tendency to reverse negative earnings changes in the following period (1.е., are less 
conservative) relative to the control firms. This result is consistent with a positive relation between 
internal control quality and conservatism. 

We repeat the above analyses using the accrual-based loss recognition measure and present 
the results in Table 4. Column 1 shows that the coefficient оп DCFO ж CFO is 0.669 (p < 0.01), 
indicating the presence of reporting conservatism. Column 2 shows that the addition of MW and its 
related terms to the baseline regression increases the adjusted R? from 16.7 percent to 17.0 
percent, suggesting that internal control quality provides incremental explanatory power to a firm's 
level of conservatism. The coefficient on DCFO * CFO * MW is —0.149 (p = 0.03), indicating 
that the MW firms accrue less unrealized losses in the cash-loss year; this result provides further 
support that the MW firms exhibit lower conservatism than the control firms. 

Finally, Table 5 presents the results of the same analyses using the timeliness of earnings to 
news measure. Column 1 shows that the coefficient on DR * R is 0.402 (p « 0.01), consistent with 
the presence of conservative reporting. When we add the MW and its related terms to the baseline 
regression, the adjusted R? of the model increases from 28.0 percent to 29.0 percent. Column 2 
further reveals that the coefficient on DR * R * MW is —0.072 (p — 0.06), which implies that the 
earnings of the MW firms reflect unexpected losses in a less timely manner than the control firms 
(1.е., the MW firms are less conservative). Again, this finding is consistent with the above results 
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TABLE 3 
Regression Results of H1 using the Persistence of Earnings Changes Measure 
Baseline Regression Full Regression 
Pred. (5 (2) 
Independent Variables Sign Coeff. p-value Coeff. p-value 
Intercept ? —0.042*** «0.01 —0.043*** <0.01 
DANI, | ? —0.026*** <0.01 —0.025*** <0.01 
ANI, 4 ? —0.145*** 0.01 —0.099* «0.01 
DANI, * ANI, | = —0.368++* <0.01 —0.412*** 001 
MW ? 0.007 0.26 
РАМІ, ж MW ? —0.009 0.36 
АМ, (9 MW ? —0.246*** <0.01 
DANI, , * ANI, * MW ? 0.217*** 0.01 
МВ, ? 0.005*** <0.01 0.005*** -001 
DANI, * MB, ? —0.00g «0.01 —0.008*** «001 
ANI, , * MB, ? —0.017*** <0.01 —0.016*** «001 
DANI, ж ANI; * MB, ? 0.024*** «001 0.023*** «001 
LEV, ? 0.086*** <0.01 0.085*** <0.01 
DANI, , + LEV, ? 0.006 0.47 0.007 0.39 
ANI, , * LEV, ? 0.395*** «001 0.406*** -001 
DANI, * ANI, , * LEV, & -0374*** | «001 —0.385*** <0.01 
SIZE, 7 0.005*** <0.01 0.004*** <0.0] 
РАМІ, * SIZE, ? 0.002 0.31 0.002 0.27 
ANI, | * SIZE, ? —0.096*** <0.01 —0.092*** <0.0] 
DANI, ,ANI, + SIZE, * 0.149*** — <0.01 0.144*** — <0.01 
LIT, ? —0.010 0.14 —0.010 0.16 
DANI, , * LIT, ? —0.045*** «0.01 —0.046*** <0.01 
ANI, * LIT, ? 202759 0.02 —0.187*** 001 
РАМІ, * ANI, , * LIT, = 0.137 0.15 0.147 0.13 
Industry Effects Not reported Not reported 
Adjusted R? 24.0% 24.6% 
Number of observations 7,547 7,547 


Ж, *+, +** Denote significance at the 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the regression results of H1 using the persistence of earnings changes measure of conservatism in Basu 
(1997). Column 1 (Column 2) shows tke results without (with) the MW and its related terms. The dependent variable is 
АМТ, which is the change in net income before extraordinary items for firm i in fiscal year t deflated by beginning-of-year 
total assets. MW is an indicator variable that equals 1 if the firm has at least one MW in internal controls, and 0 otherwise. 
The other variables are as previously defined. A positive (negative) coefficient on DANI, * АМ, , ж MW indicates that 
MW firms exhibit lower (greater) conservatism than the contro] firms. 


of a positive relation between internal control quality and conservatism. З 


18 Prior research shows that conservatism is associated with governance factors such as managerial ownership (LaFond 
and Roychowdhury 2008) and board of director characteristics (Ahmed and Duellman 2007). Hence, we also examine 
whether internal control quality has an incremental effect on conservatism after controlling for these factors. We repeat 
the НІ tests in Tables 3-5 by further adding CEO share ownership, institutional ownership, the duality of the CEO and 
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Regression Results of НІ Using the Accrual-Based Loss Recognition Measure 


Independent Variables 


Intercept 

DCFO 

CFO 

DCFO * CFO 

MW 

DCFO » MW 

CFO + MW 

DCFO * CFO * MW 
MB 

DCFO * MB 

CFO « MB 

DCFO * CFO * MB 
LEV 

DCFO * LEV 

CFO * LEV 

DCFO * CFO * LEV 
SIZE 

DCFO * SIZE 

CFO * SIZE 

DCFO * CFO * SIZE 
LIT 

DCFO * МТ 

CFO * LIT 

DCFO * CFO * LIT 
Industry Effects 
Adjusted R? 


Number of observations 


Pred. 
Sign 


+ -э -э -э | ---0-4 - >>> <з зш 3 3 яз +5 2 +5 + | -э-2 


991 
TABLE 4 
Baseline Regression Fall Regression 
(D OL. Мэнэн 
Coeff. p-value Coeff. p-value 
—0.020*** 0.01 —0.014* 0.09 
0.002 0.86 —0.026* 0.10 
—0.502*** <0.01 —0.534*** — —0,01 
0.669 ** «0.01 0.643*** — «001 
—0.013* 0.07 
—0.006 0.64 
0.048 0.34 
—0.149** 0.03 
0,003*** <0.01 0.003*** <0.01 
—0.004*** <0.01 —0.004*** 0.01 
0.000 1.00 0.000 0.95 
0.000 0.98 0.000 0.93 
—0.023 0.11 —0.021 0.15 
0.004 0.88 —0.006 0.82 
—0.232** 0.03 —0.241** 0.03 
0.455*** <0.01 0.438*** <0.01 
0.003** 0.03 0.003** 0.03 
0.000 0.97 0.000 0.97 
0.006 0.56 0.005 0.59 
—0.070*** <0.01 —0.070*** — «0.01 
—0.009 0.40 —0.008 0.46 
—0.024 0.16 —0.023 0.17 
0.046 0.58 0.041 0.62 
0.083 0.42 0.104 0.31 
Not reported Not reported 
16.7% 17.0% 
7,547 1,547 


Ж ++, +++ Denote significance at the 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the regression results of H1 using the accrual-based loss recognition measure of conservatism in Ball and 
Shivakumar (2005). Column 1 (Column 2) shows the results without (with) the MW and its related terms. The dependent 
variable is ACCRUAL, which is net income before extraordinary items minus operating cash flows for firm і in fiscal year 
t deflated by beginning-of-year total assets. The other variables are as previcusly defined. A negative (positive) coefficient 
on DCFO ж CFO + MW indicates that MW firms exhibit lower (greater) conservatism than the control firms. 


Our findings on the control variables are generally consistent with prior research. For in- 
stance, the coefficients on DANI, , * ANI, * МВ and DR ж К ж MB confirm the negative relation 


Chairman positions, board size, board independence, the share ownership of the independent directors, and the average 
outside directorships of the independent directors. The sample size for these tests is 1,920, which is much smaller than 
that used in the analyses in Tables 3—5. This is because we obtain the data on managerial ownership from the Execu- 
Comp database and the governance characteristics from the RiskMetrics database. Both databases provide information 
on S&P 1500 companies only. Our inferences on the results for H1 (untabulated) remain unchanged. Specifically, the 
coefficients on DANI, , є АМ, * MW, DCFO ж CFO * MW, and DR * R * MW are 0.423 (р = 0.06), —0.510 (р = 


0.03), and —0.089 (p = 0.07), respectively. 
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TABLE 5 
Regression Results of H1 using the Timeliness of Earnings to News Measure 
Baseline Regression Full Regression 
(1) (2) 
Independent Variables Sign Coeff. p-value Coeff. p-value 
Intercept ? —0.011* 0.07 —0.009 0.13 
рк 2 0.049*** <0.01 0.052*** <0.01 
R ? 0.011** 0.05 0.013** 0.03 
DR*R + 0.402*** <0.01 0.414*** «001 
MW 2 -0.010 0.39 
DR * MW 2 -0.017 0.41 
R* MW 2 ~0.011 0.29 
DR*R* MW ? —0.072* 0.06 
МВ 2 0.001 0.37 0.001 0.37 
DR * MB ? —0.004** 0.05 —0.004* 0.06 
В * MB 2 —0.001 0.23 —0.001 0.22 
DR + В * MB 2 —0.020*** <0.01 —0.020*** 001 
LEV ? —0.037 0.12 —0.033 0.15 
DR * LEV 2 0.018 0.67 0.017 0.69 
R * LEV ? 0.032 0.19 0.030 0.22 
DR*R* LEV + 0.372**** «0.01 0.378*** <0.01 
SIZE ? 0.010 **:* «0.01 0.009*** <0.01 
DR * SIZE ? —0.005 0.21 —0.004 0.23 
В * SIZE ? —0.001 0.84 -0.001 0.79 
DR + К * SIZE - —0.077*** «0.01 —0.076*** «001 
LIT ? —0.063*** <0.01 —0.062***  <0.01 
DR * LIT 2 0.011 0.57 0.012 0.55 
R * LIT ? 0.013 0.22 0.014 0.19 
DR * R * LIT + —0.034 0.32 —0.033 0.34 
Industry Effects Not reported Not reported 
Adjusted R? 28.0% 29.0% 
Number of observations 7,547 7,547 


+, ЖЖ, ЖЕЖ Denote significance at the 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the regression results of H1 using the timeliness of earnings to news measure of conservatism in Basu 
(1997). Column 1 (Column 2) shows the results without (with) the MW and its related terms. The dependent variable is МІ, 
which is the net income before extraordinary items for firm і in fiscal year t deflated by the beginning-of-year market value 
of equity. The other variables are as previously defined. A negative (positive) coefficient on DR * R * MW indicates that 
MW firms exhibit lower (greater) conservatism than the control firms. 


between conservatism and unconditional conservatism documented in other studies.’ The coeffi- 
cients on DANI, АМ, | * LEV, DCFO ж СЕО + LEV, and DR + R + LEV are also consistent 
with prior research that shows that more highly leveraged firms exhibit more conservatism. The 


19 We also use another measure of unconditional conservatism devised by Penman and Zhang (2002), which is the amount 
of a firm’s “hidden” reserves estimated using the sum of the value of the LIFO reserve, the research and development 
reserve, and the advertising reserve. Using this alternative measure, we continue to find a significant and positive 
relation between internal control quality and conservatism, and a significant and negative relation between unconditional 
conservatism and our conservatism measures. 
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coefficients on DANI, , ж ANI, | ж SIZE, DCFO + CFO ж SIZE, and DR * К * SIZE suggest that 
larger firms have lower conservatism, consistent with the aggregation effect discussed in Givoly et 
al. (2007) and with the information asymmetry hypothesis discussed in LaFond and Watts (2008). 
Finally, similar to LaFond and Roychowdhury (2008), we do not find a significant relation be- 
tween the level of litigation risks and conservatism. 


Regression Results for H2 

Table 6 reports the regression results for testing H2 using the persistence of earnings changes 
measure (Column 1), accrual-based loss recognition measure (Column 2), and timeliness of earn- 
ings to news measure (Column 3). For brevity, we present only the coefficients of the three-way 
interaction terms for the control variables. Column 1 reveals that the coefficient on 
DANI, з * ANI, , ж FIXED is —0.622 (р = 0.03). This result shows that MW firms that remediate 

















TABLE 6 
Regression Results of H2 
Persistence Accrual-Based Timeliness 
of Earnings А Loss of Биш ings 
ecognition to News 
A=DANI,; A = DCFO А = DR 
B=ANI,, В = CFO B=R 
Pred. о шиш Хас 
Sign Coeff. p-value Coeff. p-value Coeff. p-value 
Intercept 2 —0.004 0.85 —0.020 0.28 —0.017 0.56 
A 1 —0.004 0.91 —0.111*** «001 —0.003 0.95 
В 7 —0.553*** 001 —0.803*** «001 0.046 0.39 
АВ -,+,+ 0.160 0.54 0.436* 0.10 0.281*** «10.01 
FIXED ? 0.014 0.59 0.016 0.40 0.039 0.19 
А * FIXED ? —0.018 0.63 0.097*** —0,01 0.069 0.11 
B * FIXED ? 0.240 0.29 0.205 0.17 — 0.002 0.97 
A + В * FIXED ? —0.622** 0.03 0.748*** <0.01 0.186* 0.07 
А + B* MB ? 0.050 0.15 —0.041*** 0.01 0.013 0.16 
A*B*LEV – +, + 2.082** 0.05 0.397 0.38 0.391* 0.07 
A + B * SIZE хээл 0.021 0.85 —0.201*** 001 0.036 0.24 
А+ Вж LIT – +, + —0.700** 0.05 -0.393 0.17 —0.241*** 004 
Industry Effects Not reported Not reported Not reported 
Adjusted R? 30.7% 43.1% 33.696 ` 
Number of 516 516 516 
observations 


*, эж, *** Denote significance at ће 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the regression results of H2 using the persistence of earnings changes measure (Column 1), accrual-based 
loss recognition measure (Column 2), and timeliness of earnings to news measure (Column 3). The dependent variables for 
Columns 1, 2, and 3 аге ANT, ACCRUAL, and NI, respectively. FIXED is an indicator variable that equals 1 if an MW firm 
receives an unqualified second SOX 404 report, and 0 otherwise. All the other variables are as previously defined. For 
brevity, we present only the coefficients of the three-way interaction terms of the control variables. A negative (positive, 
positive) coefficient on DANI, * АМ, у ж FIXED (DCFO * CFO * FIXED, DR ж К ж FIXED) indicates that MW firms 
that remediate their weaknesses at the time of the second SOX 404 report exhibit greater conservatism than MW firms that 
fail to do so. A positive (negative, negative) coefficient on DANI, , * АМТ, * FIXED (DCFO ж CFO * FIXED, DR ж R » 
FIXED) indicates that MW firms that remediate their weaknesses at the time of the second SOX 404 report exhibit lower 
conservatism than MW firms that fail to do so. 
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their weaknesses at the time of the second SOX 404 report exhibit greater conservatism than MW 
firms that continue to have weaknesses at the time of the report. The results using the other two 
measures of conservatism are consistent with this finding. Specifically, the coefficient on DCFO * 
CFO * FIXED in Column 2 is 0.748 (p < 0.01), and the coefficient on DR * R * FIXED in 
Column 3 is 0.186 (p = 0.07). Hence, the regression results on H2 using all three measures of 
conservatism strengthen our earlier finding of a positive relation between internal control quality 
and conservatism. 6 

In summary, the empirical findings using all three measures of conservatism show that inter- 
nal control quality provides incremental explanatory power to a firm’s level of conservatism. We 
find consistent evidence of a positive relation between internal control quality and conservatism. 
Specifically, firms with MWs exhibit lower conservatism than firms without such weaknesses, and 
MW firms that subsequently remediate their weaknesses exhibit greater conservatism than MW 
firms that fail to do so. Taken together, our results show that effective internal controls act as a 
mechanism that facilitates conservatism, consistent with the agency framework of positive ac- 
counting theory that recognizes conservatism as a governance mechanism (Watts 2003; Ball and 
Shivakumar 2005). In the next section, we describe additional analyses used to test the robustness 
of these results. 


V. ADDITIONAL ANALYSES 
Self-Selection of Internal Control Quality (H1) 

Prior research suggests that MW firms are likely to be systematically different from other 
firms (Ashbaugh-Skaife et al. 2007; Doyle et al. 2007b). This could result in potential self- 
selection bias in our observed samples for our H1 tests. Although the tests on the change in the 
status of internal controls mitigate this concern, we further econometrically control for this bias 
using three methods. First, we use Heckman's (1979) two-stage procedure, in which the first stage 
comprises a probit regression of MW on its determinants. These determinants largely follow those 
of Doyle et al. (20075) and Ashbaugh-Skaife et al. (2007), and are detailed in Table 7.” From this 
regression that identifies the likelihood of a firm being “selected” as an MW firm, we calculate the 
inverse Mills ratio, LAMBDA (see Heckman 1979; Leuz and Verrecchia 2000), and include it in 
our main regressions to control for the likelihood of self-selection into the MW group. 

Second, we create a matched sample based on the predicted probabilities from the first-stage 
probit regression. This method, known as propensity score-matching (LaLonde 1986), creates a 
non-MW control sample with the same predicted probabilities of having an MW (i.e., it considers 
all predictive variables together to estimate the same propensity of having an MW). The propen- 
sity score-matching produces a matched sample of 850 control firm observations, resulting in a 
combined sample of 1,700 firm observations.”! Third, we include the determinants of MWs, as 
well as their two-way and three-way interactions with the parameters of the three conservatism 
measures, aS additional control variables in the main regressions presented earlier. This method 
controls for the possibility that the factors that produce the MWs could also be driving observed 
differences in conservatism. 

Table 7 presents the results of the above three tests for the persistence of earnings changes 
measure. For brevity, we present only the results of the three-way interaction terms for the control 
variables. Column 1 shows the second-stage regression results of the Heckman two-stage proce- 





29 The first-stage regression results (untabulated) show that financial health (LOSS), rapid growth (EXTR_SALES), audit 
quality (BIGN), the occurrence of a restatement (RESTATE), and the occurrence of an auditor change (AUDCHANGE) 
are significantly associated with the presence of MWs. Details of the results are available from the authors upon request. 

21 The search algorithm finds the propensity scores closest to our MW firm observations until the first decimal place. The 
algorithm is available upon request. 
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dure after controlling for LAMBDA. Column 2 shows the propensity score-matching results and 
Column 3 shows the results after controlling for the determinants of МУУ. In all three regressions, 
we find that the coefficient on DANI, ,*ANI, ,*MW remains positive and significant, confirming 
our earlier findings in Table 3. When we repeat these analyses for the other two measures of 
conservatism, the results for all the three tests (untabulated) show that the coefficients on DCFO 
+ CFO * MW and DR * R ж MW remain negative and significant (р < 0.10). Taken together, we 
interpret these findings as suggesting that self-selection issues are not likely to drive the results of 
Н1 that we obtained earlier. 


Self-Selection of Improvement in Internal Control Quality (Н2) 


It is also possible that MW firms that choose to improve their internal controls can introduce 
a possible self-selection bias into our tests of H2. To control for this potential bias, we use the 
Heckman two-stage procedure and estimate a probit regression of FIXED on the same determi- 
nants of MWs as above, reasoning that the characteristics that lead to the incidence of MWs are 
also likely to cause difficulties for these firms to remediate MWs. We also include governance 
characteristics because firms with better governance are more likely to remediate MWs (Goh 
2009) 3 We hand-collect data on the board and audit committee characteristics from the proxy 
statements of the 516 MW firms used earlier to test H2, and are left with 437 firms with available 
data. 

Table 8 reports the results after including the inverse Mills ratio, LAMBDA, in our previous 
regressions used to test H2. For brevity, we omit the results for the control variables. The results 
in Table 8 are similar to those when we do not control for self-selection. Specifically, the coeffi- 
cients on DANI, , ж АМ, * FIXED, ОСЕО ж CFO * FIXED, and DR + R ж FIXED remain 
significant and have the same signs as those in Table 6. We continue to find similar significant 
results (untabulated) when we include the determinants of FIXED, instead of LAMBDA as addi- 
tional control variables in the regressions. In sum, we conclude that the results for H2 are unlikely 
to be driven by self-selection in the improvement in internal control quality. 


Alternative Measures of Conditional Conservatism 


Khan and Watts (2009) suggest a firm-specific measure of conservatism (C_SCORE) that is 
based on Basu’s (1997) measure of asymmetric timeliness of earnings. We follow their approach 
and derive the C_SCORE measure, with higher C_SCORE indicating greater conservatism. The 
regression model and results are presented in Table 9.24 In summary, the results using the 
C_SCORE measure confirm our earlier findings. Specifically, the coefficient on MW in Column 1 


22 To save space, we do not tabulate the results. However, these results, as well as all other results that we do not tabulate, 
are available from the authors upon request. 

23 These governance characteristics include board size, board independence, board meeting frequency, audit committee 
size, audit committee accounting financial expertise, audit committee meeting frequency, the duality of the CEO and 
Chairman positions, board commitment (proxied by the number of outside directorships in public companies held by the 
board directors), and board diligence (proxied by the proportion of directors who fail to attend more than 75 percent of 
the board meetings). The first-stage regression results show that firm profitability, the occurrence of a restatement, board 
independence, and audit committee meeting frequency are significantly associated with the likelihood of the remediation 
of MWs. Details on the definitions of the governance characteristics and the first-stage regression results are available 
from the authors upon request. 

Khan and Watts (2009) show that firms with high uncertainty and long investment cycles have future gains that are less 
verifiable and are more susceptible to gaming, generating a higher contracting demand for conservatism. Hence, we 
control for information uncertainty using STD (defined as the standard deviation of monthly stock returns) and control 
for the length of the investment cycle using CYCLE (defined as the depreciation expenses deflated by lagged asset), We 
control for AGE (firm age in years) because younger firms tend to have larger information asymmetry that is associated 
with more conservatism. Finally, we include SPREAD (the bid-ask spread scaled by the midpoint of the spread) to proxy 
for information asymmetry that is positively related to conservatism (LaFond and Watts 2008). 


? 
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TABLE 8 


Regression Results of H2 after Controlling for Self-Selection of Remediation of MWs 
Using the Heckman Two-Stage Procedure 


Persistence of aoa угийн га 
Changes Recognition to News 
A=DANI,.; А = DCFO A=DR 
В=АЛТ, 1 В = СЕО В = 8 
РТ (1) (2) (3) 
Sign Coeff. p-value Coeff. p-value Coeff. p-value 
Intercept 2 0.004 0.88  —0.010 0.65 0.014 0.89 
A ? 0.000 099 -0.102*** «001 0.100 0.41 
B ? 0.522*** «001  —0.96*** «001 0.132 0.45 
АВ ~+,+ 0.230 037 04575 0.09 0.455 0.11 
FIXED ? 0.009 073 0.005 078 0.049 0.63 
АЖ FIXED ? —0.001 0.98 0.088*** <0.01 0.166 0.24 
В * FIXED ? 0.348 0.13 0.055 0.69 —0.031 0.87 
А + В * FIXED ? —0.905** 0.01 0.664*** <0.01 0.893** 0.01 
LAMBDA 2 – 0.021 035  —0.042*** 001  —0.144** 0.05 
Control Variables Not reported Not reported Not reported 
Industry Effects Not reported Not reported Not reported 
Adjusted gp? 24.696 49.4% 23.1% 
Number of observations 437 437 437 


Ж жж *** Denote significance at the 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the results of the regressions that control for self-selection in the remediation of MWs. The results are 
from the second-stage regression of the Heckman (1979) two-stage procedure, after controlling for LAMBDA. LAMBDA is 
the inverse Mill ratio calculated from the first-stage regression in which we regress FIXED on the determinants of MW 
remediation. The determinants of MW remediation include the determinants of MWs and the governance characteristics 
detailed in footnote 23. For brevity, we do not report the results for the control variables. Columns 1, 2, and 3 present the 
results using the persistence of earnings changes measure, accrual-based loss recognition measure, and the timeliness of 
earnings to news measure, respectively. In Column 1, the dependent variable is АМТ, which is the change in net income 
before extraordinary items for firm і in fiscal year г deflated by beginning-of-year total assets. In Column 2, the dependent 
variable is ACCRUAL, which is net income before extraordinary items minus operating cash flows for firm і in fiscal year 
т deflated by beginning-of-year total assets. In Column 3, the dependent variable is МІ, which is the net income before 
extraordinary items for firm i in fiscal year t deflated by the beginning-of-year market value of equity. All the other 
variables are as previously defined. 


is —0.007 (р = 0.02), consistent with MW firms exhibiting lower levels of conservatism than the 
control firms. The coefficient on FIXED in Column 2 is 0.288 (p — 0.03), consistent with MW 
firms that remediate their weaknesses exhibiting greater conservatism than MW firms that con- 
tinue to have weaknesses. 

То further corroborate our findings, we use a second accrual-based measure of conservatism 
(CON-ACC) that is suggested by Givoly and Hayn (2000). This measure is computed as income 
before extraordinary items less cash flows from operations plus depreciation expense deflated by 
average total assets, averaged over a three-year period before year t.” The lower the CON-ACC, 


25 The intuition behind this measure is that conservative accounting results in persistently negative accruals. Averaging the 
accruals over a number of periods ensures that the effects of any temporary large accruals are mitigated, as accruals tend 
to reverse within a one- to three-year period (Richardson et al. 2005). For the H1 test, we measure CON-ACC over the 
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999 


Regression Results Using the Firm-Specific Conservatism Measure in 


Hi: . | 
C SCORE = Му MMW + №,512Е + A4LEV + ММВ + ASSTD + CYCLE + МАСЕ 
+ AgSPREAD + є. 
H2: 
C SCORE = № + A, FIXED + MSIZE + МУ + МИВ + ASSTD + CYCLE + МАСЕ 
+ AgSPREAD + є. 
Test of НІ Test of H2 
Pred (1) (2) 
. Sign | Coeff. | | p-vale Сет руш 
Intercept ? 0.484*** — 001 1.408* 0.07 
MW 2 —0.007** 0.02 
FIXED 2 0.288** 0.03 
SIZE - —0.067*** «001 —0.184 0.14 
LEV + 0.233*** <0.01 184758 Ж «001 
MB 2 —0.001** 0.02 —0.025** 0.03 
STD + 0.063*** 001 1.373 0.26 
CYCLE - —0.340*** 001 —6.569*** 001 
АСЕ - 0.000 0,64 -0.008 0.19 
SPREAD + —0.011** 0.02 2.086* ** 0.01 
Industry Effects Not reported Not reported 
Adjusted R2 73.8% 7.0% 
Number of observations 5836 - 500 


ж, **, *** Denote significance at the 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the results of the regressions that examine Н1 and H2 using the firm-specific conservatism measure 
C_SCORE in Khan and Watts (2009). The dependent variable is C SCORE, with higher C SCORE indicating greater 
conservatism. The other variables are defined as follows or as previously defined. 


Variable Definitions: 
STD - standard deviation of monthly stock returns; 
CYCLE - depreciation expenses deflated by the beginning-of-year total assets, which is a decreasing measure of 
the length of the investment cycle; 
AGE = firm age, the number of years since the firm appears in CRSP database; and 
SPREAD = bid-ask spread scaled by the midpoint of the spread, obtained from CRSP. 


three-year period before each 2000 and 2001 firm-year observation. For the H2 test, we measure CON-ACC over the 

three-year period right after the year of the second SOX 404 report. In these tests, we control for firm size (SIZE), 

leverage (LEV), and litigation risks (147) as before. Consistent with Ahmed et al. (2002), we control for growth in sales 
(SALESGRO because it will affect accruals such as inventory and receivables, which in turn affect CON-ACC. We 
control for research and development plus advertising expenditures (RDADV) because they are likely to capture eco- 
nomic rents generated by assets-in-place, growth opportunities, and GAAP mandated conservatism (Ahmed 1994). 
Finally, we control for profitability (cro) 8s proxied by the operating cash flows deflated by beginning-of-year total 
assets, because Ahmed et al. (2002) argue that profitable firms tend to use more conservative accounting. 
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the more conservative the firm’s accounting. Table 10 presents the results using CON-ACC, which 
is consistent with all our earlier findings. Specifically, the coefficient on MW in Column 1 is 0.010 
(р = 0.05), implying that MW firms have larger operating accruals (i.e., lower conservatism) than 
the control firms. Further, the coefficient on FIXED in Column 2 is —0.005 (p = 0.08), which 
indicates that MW firms that remediate their weaknesses exhibit greater conservatism than MW 
firms that fail to do so. In sum, our earlier findings on H1 and H2 are robust to alternative 
conservatism measures suggested by Khan and Watts (2009) and Givoly and Hayn (2000). 


Expanded Sample for H2 Tests 


In our H2 tests in Table 6, we remove 473 firms without SOX 404 opinions; that is, firms that 
are either non-accelerated filers (289 firms) or have terminated the registration of their securities 


TABLE 10 
Regression Results Using the Accrual-Based Conservatism Measure in 
Givoly and Hayn (2000) 

НІ: 

СОМ-АСС = 0) + 0,MW + SIZE + ВЗАГЕЗСКОУТН + 04RDADV + 0:СЕО + OLEV 

+ 6,LIT + є. 

H2: 

CON-ACC = 6) + O,FIXED + 0,SIZE + 0,SALESGROWTH + 04RDADV + 0:СЕО 

+ OLEV + G;LIT + e. 
Test of H1 Test of H2 
Pred. а) (2) 
Sign Coeff. p-value Coeff. p-value 

Intercept ? —0.079*** <0.01 —0.021*** <0.01 
MW ? 0.010** 0.05 
FIXED ? —0.005* 0.08 
512Е + 0.007*** — «10.01 0.002** 0.03 
SALESGROWTH + —0.003 0.38 0.005* 0.10 
RDADV + —0.012*** <0.01 0.000 0.63 
CFO - 0.068*** «0.01 0.052*** «0.01 
LEV = -0.009 0.45 —0.010 0.18 
LIT - —0.029*** «001 —0.008** 0.04 
Industry Effects Not reported Not reported 
Adjusted R2 3.5% 10.4% 
Number of observations 7,634 407 





Ж, жж *** Denote significance at the 10 percent, 5 percent, and 1 percent levels, respectively, based on two-tailed tests. 
This table reports the results of the regressions that examine НІ and H2 using the accrual-based conservatism measure in 
Givoly and Hayn (2000). The dependent variable is CON-ACC, which is income before extraordinary items less cash flows 
from operations plus depreciation expense deflated by average total assets, and averaged over a three-year period. The 
lower the CON-ACC, the preater the conservatism. The other variables are defined as follows or as previously defined. 


Variable Definitions: 
SALESGROWTH = percentage of annual growth in total sales (Data12); and 
RDADV = research and development expenditures plus advertising expense divided by sales. 


The Accounting Review May 2011 
American Accounting Association 


Internal Controls and Conditional Conservatism 1001 


(184 firms). This procedure results in a loss of roughly half of our MWs observations. To recover 
some of these lost observations, we relax our requirements to determine the remediation of MWs 
by using the SOX 302 opinions for non-accelerated filers. For these non-accelerated filers, we rely 
on the SOX 302 opinions at the time of the second SOX 404 report that would have been issued 
if the firm had been an accelerated filer. This procedure yields larger sample sizes of 814, 800, and 
651 firms with available data from Compustat and/or CRSP for the persistence of earnings 
changes measure, accruals-based loss recognition measure, and timeliness of earnings to news 
measure, respectively." We then replicate the H2 tests in Table 6 using these expanded samples, 
and find that the inferences of our earlier results remain unchanged. Specifically, untabulated 
results show that the coefficients on DANI, , * ANI, ж FIXED, DCFO ж СЕО ж FIXED, and DR 
ж R ж FIXED are —1.393 (р < 0.01), 0.557 (p = 0.06), and 0.607 (р = 0.01), respectively. 


Contextual Analysis of the Nature of MWs 


Finally, to provide a stronger link between internal controls and conservatism, we further 
explore the nature of the MWs. Using the arguments found in Ball (2001). we expect firms with 
MWSs pertaining to investment and/or asset management related decisions to have a greater incen- 
tive to avoid timely loss recognition." In addition, following the arguments in Doyle et al. 
(2007a), we expect company-level MWs to be more severe and pervasive than other types of 

. MWs, and consequently to result in even lower conservatism.” Hence, we code firms with at least 
one of the above types of MWs as “severe MW firms” and test whether these firms exhibit lower 
conservatism than the other MW firms. To conduct the analyses, we replace MW with SEVERE 
in the regression models (2), (4), and (6), where SEVERE is an indicator variable that equals 1 
for severe MW firms, and 0 otherwise. Using the MW firms observations from fiscal years 2001 
and 2002, the regression results (untabulated) reveal that the coefficients on 
DANI, , * ANI, * SEVERE, DCFO ж CFO ж SEVERE, and DR + R + SEVERE are 0.677 (p < 
0.01), —0.069 (р = 0.57), and —0.127 (р = 0.07), respectively. Hence, the contextual analysis of 
the nature of MWs shows some evidence that severe MW firms exhibit lower conservatism than 
the other MW firms. This evidence provides further empirical support of a positive relation 
between internal control quality and conservatism. 


26 The significantly smaller sample size for the timeliness of earnings to news measure is due to the additional require- 
ments for returns data from CRSP. Unlike before, we do not require that all three tests using different measures of 
conservatism have the same sample size. In other words, we do not impose the same data limitations to all measures of 
conservatism. This allows us to obtain the maximum sample size for each measure of conservatism to conduct the 
additional analyses on H2. 

Ball (2001) states that both debt contracting and corporate governance give rise to the demand for accounting conser- 
vatism, because conservatism can incorporate into financial statements adverse information about future cash flows of 
investment that is observable only to managers. MWs in the investment and/or asset management-related decisions can 
impede this governance role of consezvatism. For instance, control weaknesses over the computation of the future cash 
flows of acquired investments and the lack of staff expertise in estimating future cash flows for asset impairment tests 
make it harder for the accounting system to incorporate into financial statements adverse information about future cash 
flows, which will lead to lower level of conservatism. The examples of MWs found in the Appendix are illustrative of 
these types of MWs. 

Following Doyle et al. (2007a), company-level MWs include override by senior management and ineffective control 
environment. These MWs are more severe and pervasive than account-specific MWs, which include inadequate internal 
controls for accounting for loss contirgencies, deficiencies in the documentation of a receivables securitization program, 
and inadequate internal controls over the application of new accounting principles or the application of existing ac- 
counting principles to new transactions. Consistent with Doyle et al. (2007a), we also consider firms that have at least 
three account-specific MWs to have company-level MWs. We would like to thank Jeffrey Doyle, Weili Ge, and Sarah 
McVay for sharing the data on the classification ої МУ. 
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VI. CONCLUSION 

Our results show that firms that disclose material weaknesses (MWs) exhibit lower condi- 
tional conservatism than firms without such weaknesses, and MW firms that remediate their 
weaknesses exhibit greater conservatism than MW firms that fail to do so. Collectively, our results 
are consistent with strong internal controls facilitating conservatism. These findings extend the 
literature on the reporting effects of strong versus weak internal controls. In addition, our study 
contributes to the broader literature that tests the extent to which conservatism is shaped by a 
firm’s contracting and governance environment. 

One limitation of our study is that for our tests of H2, we remove nearly half of the 1,098 MW 
firms that do not have SOX 404 opinions due to their non-accelerated filer status and due to 
termination of securities registration. It is unclear how the addition of these firms would affect the 
results of H2. Another limitation is that our findings cannot directly shed light on the current 
debate regarding the costs and benefits of SOX. Our study informs only of the value of internal 
control quality, but not of the effectiveness of the internal control reporting requirements under 
SOX. 


APPENDIX 
EXAMPLES OF MATERIAL WEAKNESSES THAT COULD IMPEDE THE TIMELY 
RECOGNITION OF LOSSES 
NEOMAGIC CORP (28 April, 2005) 


NeoMagic Corporation has a material weakness as of January 31, 2005 related to its process 
of evaluating long-lived assets for impairment, including insufficient controls over the review and 
documentation of key assumptions used in preparing forecasted cash flows used to support Neo- 
Magic Corporation’s impairment analyses. This material weakness resulted in an audit adjustment 
that was recorded by NeoMagic Corporation to recognize the impairment of intangible assets. 


ОРЕКТУ CORP (16 March, 2005) 


We did not have sufficient internal personnel and technical expertise to properly apply ac- 
counting principles to certain non-routine matters; in particular, we applied certain provisions of 
Statement of Financial Accounting Standards No. 142, Goodwill and Other Intangibles (SFAS No. 

` 142), in an incorrect manner when conducting our annual analysis for potential impairment of 
goodwill, which led to an error that was identified and subsequently corrected. 


CLEARONE COMMUNICATIONS INC (18 August, 2005) 

We have a material weakness related to the tracking and valuation of inventory, including 
controls to identify and properly account for obsolete inventory. Our accounting policies and 
practices over tracking and valuation of inventory, including controls to identify and properly 
account for slow-moving, obsolete inventory were inconsistent with GAAP. This material weak- 
ness resulted in errors in recording inventories at the lower of cost or market, and errors for 
inventory shrinkage. 

We have a material weakness in accounting for non-routine transactions, which include busi- 
ness combinations, discontinued operations, sale of a business unit (other than discontinued op- 
erations), and evaluation and recognition of impairment charges. This material weakness resulted 
in improper purchase price allocations in business combinations, improper amortization and de- 
preciation of long-lived assets, improper identification and recording of activities related to dis- 
continued operations, improper recording and reporting the sale of business units, improper evalu- 
ation of triggering events associated with impairment of long-lived assets (including annual 
impairment tests for goodwill); and improper calculating and recording of impairment charges. 
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ASCONI CORP (27 May 2004) 


We lacked controls to ensure that agreements, contracts, and other documents relating to 
investments in subsidiaries or investees were provided to and reviewed by accounting and finan- 
cial reporting personnel on a timely basis to ensure that the financial reporting and disclosure 
implications of such transactions could be considered and reflected in the financial statements in 
the proper periods. 


FINANCIAL INDUSTRIES CORP (10 June, 2003) 


Management identified a significant number of internal control weaknesses in several key 
areas that had resulted in material misstatement of financial results. These include weaknesses in 
the following areas, among others: deferred policy acquisition costs; present value of future profits 
of acquired businesses; investment accounting; consolidation process; purchase accounting; and 
intercompany accounting. | 
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L INTRODUCTION 
elative performance evaluation (RPE) entails the use of peer performance in evaluating an 
Ree performance. Based upon the premise that peer performance captures common 
exogenous shocks, the major theoretical benefit of RPE is to insulate an agent from 
common risk, while providing the principal with a more informative measure to assess the agent’s 
actions. Consequently, RPE is expected to lead to risk-sharing benefits in incentive contracting 
(e.g., Lazear and Rosen 1981; Holmstrom 1982; Nalebuff and Stiglitz 1983). 

Despite RPE’s theoretical appeal, prior empirical research offers mixed evidence on the use of 
RPE in executive compensation contracts based on an implicit approach.! The implicit approach 
infers RPE use by regressing executive pay on industry performance across a population of firms, 
and thus relies on simplified assumptions concerning RPE contract details (such as RPE peer- 
group composition, performance metrics used in RPE, and components of pay covered by RPE). 
These assumptions unavoidably introduce measurement errors into implicit tests (Murphy 1999; 
Bannister and Newman 2003). More importantly, studies that rely on the implicit approach are 
unable to examine the execution of RPE contracts (such as peer selection), which is essential to 
evaluating the costs and benefits of RPE as an incentive mechanism (Matsumura and Shin 2006; 
Bol et al. 2010). 

The SEC's 2006 executive compensation disclosure rules require firms to provide details on 
how performance targets (including relative performance targets) are used in setting executive pay. 
We exploit this mandatory disclosure regime to examine the explicit use of RPE and related peer 
groups. We first examine the extent to which simplified assumptions concerning RPE contract 
details affect inferences drawn from the traditional implicit approach. We then examine firms' 
decisions to explicitly use RPE in setting executive pay. Finally, we focus on RPE peer groups, an 
essential element of RPE contracts, to investigate factors that might influence the selection of 
peers for RPE purposes. 

Using S&P 1500 firms' first proxy disclosures under the SEC's new disclosure rules, we find 
that about 25 percent of S&P 1500 firms explicitly use RPE in setting executive compensation. 
When using the implicit approach (such as in Albuquerque (20091), where RPE peers are matched 
on both industry and size, we do not find evidence of RPE use in S&P 1500 firms in our 2006 
sample.” We further show that this implicit approach is unable to detect RPE use even among firms 
that claim to use RPE in setting executive pay. However, after incorporating disclosed peer-group 
composition, we find a significantly negative association between CEO pay and stock performance 
for disclosed peers, supporting the use of RPE. Moreover, we find that failure to account for the 
link between RPE-based performance targets and future peer performance hinders the detection of 
RPE use in implicit tests. These results illustrate that simplified assumptions about RPE contract 
details cloud inferences drawn from implicit tests. 

Turning to firms’ decisions to explicitly use RPE, we find that firms exposed to higher 
common risk, operating in less-concentrated industries, having fewer growth opportunities, and 
hiring less-wealthy CEOs are more likely to use RPE. These findings are in line with economic 
theories predicting that a firm’s use of RPE increases with the firm’s exposure to common risk and 
decreases with the CEO's self-hedging ability (e.g., Lazear and Rosen 1981; Holmstrom 1982; 
Garvey and Milbourn 2003). Moreover, firms that are larger, have more independent and larger 
boards, and hire compensation consultants are more likely to use RPE. These results reveal the 


| Examples include Antle and Smith (1986), Barro and Barro (1990), Gibbons and Murphy (1990), Janakiraman et al. 
(1992), Joh (1999), Aggarwal and Samwick (1999), Garvey and Milbourn (2003), Rajgopal et ај. (2006), and Albu- 
querque (2007, 2009). 

2 In untabulated results, using the implicit approach as per Albuquerque (2009), we replicate Albuquerque’s main finding 
and obtain a significantly negative coefficient on industry-size-matched peer returns over the 1992-2005 period. 
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importance of board structure and compensation consultants in facilitating the use of RPE. The 
overall evidence supports the view that firms consider both costs and benefits of RPE as an 
incentive mechanism when deciding to use RPE. 

Next, among firms using RPE in the pay-setting process, we examine whether firms choose 
RPE peers in a manner consistent with efficient contracting or rent extraction. Under the efficient- 
contracting perspective, the selection of RPE peers should consider potential peers’ ability to 
capture common risk (e.g., Holmstrom 1982). Tournament theory further suggests selecting RPE 
peers such that RPE firms’ and peer firms’ executives have equal chances to “win,” given similar 
effort levels (e.g., Lazear and Rosen 1981). Under the rent-extraction perspective, executives have 
incentives to choose RPE peers that they expect to underperform (e.g., Gibbons and Murphy 1990; 
Murphy 2001), as choosing peers with poor expected performance can inflate RPE firms' peer- 
adjusted performance and, hence, increase executive compensation. 

The evidence supports both efficient contracting and rent-extraction behavior in the peer- 
selection process. Consistent with efficient contracting, RPE firms tend to choose peers with 
higher common-risk-reduction ability and with greater similarity. These findings are generally 
more pronounced for RPE firms that outperform their industry than for underperforming RPE 
firms. Át the same time, we find that RPE firms are more likely to select peers that are expected 
to perform poorly (based on analysts' target price forecasts). Interestingly, this self-serving bias in 
peer selection is more significant among underperforming RPE firms. Taken together, our findings 
suggest that RPE firms' superior performance leads to efficient-contracting considerations, 
whereas RPE firms' underperformance leads to rent-seeking behavior in the peer-selection pro- 
cess. Executives in underperforming RPE firms likely have stronger incentives to behave oppor- 
tunistically in the peer-selection process because they are more concerned about potential reputa- 
tion loss and pay reduction as a result of poor performance. When we examine intertemporal 
changes in peer-group composition, we find some evidence that added peers are more similar in 
size to RPE firms, but exhibit poorer performance than dropped peers. 

Our study contributes to the executive compensation literature in three ways. First, our evi- 
dence facilitates a better understanding of the “КРЕ puzzle" that has long plagued researchers 
(Murphy 1999; Prendergast 1999). Our findings suggest that the RPE puzzle of mixed empirical 
support for RPE use arises at least in part from simplified assumptions underlying the traditional 
implicit approach. We demonstrate that incorporating RPE contract details (such as peer-group 
composition and the link between RPE-based performance targets and future peer performance) 
can significantly alter inferences drawn from implicit tests. Our evidence thus cautions against 
relying on the traditional implicit approach to draw definitive conclusions regarding RPE use. 

Second, we provide the first large-sample U.S. evidence on the explicit use of RPE. We find 
that about 25 percent of our sample firms use RPE in setting executive compensation. Our findings 
on firms’ decisions to use RPE confirm related economic theories, while also revealing the impor- 
tance of board structure and compensation consultants in facilitating RPE use. This stands in 
contrast to Carter et al. (2009), who do not find evidence in support of related economic theories 
when studying explicit RPE use in performance-vested equity plans for a sample of U.K. firms. 

Third, we provide the first large-sample evidence on the selection of RPE peers. We document 
that both efficient contracting (captured by common risk and economic similarity) and rent ex- 
traction (measured by expected peer performance) explain the likelihood of being selected as a 
RPE peer. These findings shed light on the execution of RPE contracts, which is essential to 
understanding the overall costs and benefits of RPE as an incentive device. 

Our evidence also adds to the growing literature on the use of peer groups in executive 
compensation contracts. Existing research predominantly focuses on peer groups used for 
compensation-benchmarking purposes (Bizjak et al. 2008, 2010; Cadman and Carter 2010; 
Albuquerque et al. 2009; Faulkender and Yang 2010). The purpose of compensation benchmarking 


The Accounting Review May 2011 
American Áccounting Association 


1010 Gong, Li, and Shin 


is to gauge the appropriate level of compensation, while the purpose of RPE is mainly to employ 
external performance targets (1.е., peer performance) to filter out common risk. Selecting better 
(worse) performing firms as benchmarking (RPE) peers is expected to increase executive com- 
pensation. In additional analysis, we find that, for firms disclosing both benchmarking peers and 
RPE peers, peers used solely for RPE (benchmarking) purposes exhibit higher (lower) perfor- 
mance comovement with the firms of interest and have worse (better) stock performance than 
unselected peers. These contrasting results indicate different considerations in selecting RPE peers 
versus compensation-benchmarking peers. Our study thereby fills the void in the extant literature 
that has paid little attention to RPE peers. 

We review the related literature in Section II. Section Ш describes the SEC’s new dinde 
rules, our sample-selection procedures, and descriptive statistics on RPE use for S&P 1500 firms. 
In Section IV, we use the implicit approach to evaluate RPE use, where we incorporate explicit 
RPE contract details. Section V examines firms' decisions to explicitly use RPE in executive 
compensation contracts, and Section VI investigates the selection of RPE peers. We conclude in 
Section VII. 


II. RELATED LITERATURE 
Literature on RPE Use | 


When common exogenous shocks affect the performance of multiple agents, economic theory 
predicts that an incentive scheme based on relative performance is superior to an incentive scheme 
based on individual performance if the agent is sufficiently risk-averse and common uncertainty is 
high (Lazear and Rosen 1981; Holmstrom 1982; Green and Stokey 1983). Since relative perfor- 
mance reduces the effect of common shocks on performance, RPE allows the risk-averse agent to 
bear less risk and the principal to better evaluate and motivate the agent’s effort. Besides this 
risk-sharing benefit, tournament theory proposes potential competition benefits of RPE. Under 
certain circumstances, an incentive scheme based on relative performance can result in higher 
performance compared to an incentive scheme based on individual performance (Nalebuff and 
Stiglitz 1983; Hannan et al. 2008). 

Despite its theoretical appeal, RPE involves potential costs. Heterogeneity among agents can 
dilute the benefits of RPE in a standard tournament (Lazear and Rosen 1981). If agents have 
unequal chances to win the prize given the same level of effort, a tournament can induce disad- 
vantaged agents to shirk (O’Keeffe et al. 1984) and might distort agents’ risk choices (e.g., Rosen 
1986; Knoeber and Thurman 1994; Hvide 2002). Prior studies also suggest that RPE contracts 
create adverse incentives for agents to sabotage peer performance, collude with peers, and/or 
choose inappropriate peer groups (Dye 1984; Gibbons and Murphy 1990; Aggarwal and Samwick 
1999; Murphy 2001). Moreover, empirical evidence is consistent with competitive environments 
and certain executive attributes (such as self-hedging ability and outside employment opportuni- 
ties) reducing the benefits of RPE in incentive contracting (e.g., Aggarwal and Samwick 1999; 
Garvey and Milbourn 2003; Rajgopal et al. 2006). The theory and evidence cited above provide 
potential explanations for a lack of consistent empirical evidence supporting the use of RPE in 
executive compensation contracts (see Murphy [1999] and Prendergast [1999] for reviews of 
related literature). 

While the majority of empirical studies employ the implicit approach to test RPE use, a few 
studies have examined explicit RPE use in executive compensation contracts (Murphy 1999, 2001; 
Bannister and Newman 2003; Carter et al. 2009). Murphy (1999) examines 177 large U.S. firms 
included in the 1997 Towers Perrin survey, and Bannister and Newman (2003) examine proxy 
disclosures of 160 Fortune 250 firms in fiscal years 1992 and 1993. Both studies provide descrip- 
tive statistics about explicit RPE contracts and claim that a lack of empirical support for RPE use 
could be due to incorrect assumptions and model misspecifications in implicit tests. We investigate 
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this claim in Section IV. A more recent study by Carter et al. (2009) examines explicit RPE use in 
performance-vested equity grants for a sample of 129 U.K. firms. Their analysis of firms’ choice 
between RPE use and non-RPE use yields little empirical support for related economic theories.” 
Their U.K. evidence might not generalize to the U.S. setting, however, because performance- 
vested equity plans are more prevalent in the U.K., mainly due to external pressures from insti- 
tutional investors (Carter et al. 2009; Ferri 2009). 


Literature on RPE Peer Selection 


The selection of RPE peers has significant implications for the costs and benefits of RPE, but 
few theoretical models specifically analyze the selection process. Nevertheless, theories on RPE 
use provide guidance on how to select RPE peers efficiently. To better shield executives from 
common exogenous shocks (Lazear and Rosen 1981; Holmstrom 1982; Green and Stokey 1983), 
selected peers should bear greater common risk with the RPE firms. Moreover, in a tournament, 
selecting agents with similar ability can reduce potential inefficiencies induced by unequal con- 
tests, such as shirking and suboptimal risk choices (e.g., Lazear and Rosen 1981; Hvide 2002). 
Hence, RPE firms should select peers that exhibit similar ability as themselves. 

On the other hand, executive compensation practices are often the product of “managerial 
power,” which allows rent extraction from shareholders (Bebchuk and Fried 2003). In the context 
of RPE, inefficient peer selection could involve deliberate selection of poorly performing firms as 
RPE peers to inflate peer-adjusted firm performance and thus allow executives to reap excess 
compensation (Gibbons and Murphy 1990; Murphy 2001). Another line of research proposes that 
executive compensation practices reflect symbolic considerations (Westphal and Zajac 1994; Zajac 
and Westphal 1995), suggesting that RPE firms prefer visible and well-established peers to facili- 
tate the justification of their choices to external constituencies. 

Due to limited disclosures about RPE peer groups, prior empirical studies generally assume 
that RPE peers comprise all firms in the same industry as the RPE firms. Albuquerque (2009) 
points out that RPE peer selection should consider both industry and size effects, and reports 
stronger evidence of RPE use when peers are matched on both industry and size, as opposed to 
industry only. Dikolli et al. (2010) examine the consequences of the implicit test’s departure from 
the details in explicit RPE contracts. They demonstrate analytically that an empiricist’s inability to 
observe the board’s actual peer selection and aggregation of peer performance creates measure- 
ment errors that can significantly bias against detecting RPE use. Their theoretical proposition is 
supported by simulation results. These studies highlight the importance of peer-group composition 
in drawing inferences from traditional implicit tests. 


III. INSTITUTIONAL BACKGROUND, SAMPLE, AND DESCRIPTIVE STATISTICS 

New Disclosure Rules on Executive Compensation 

Amid increased media and shareholder scrutiny of executive pay practices, the SEC amended 
its rules on executive compensation disclosures in 2006 to increase the transparency of executive 
compensation contracts. The new disclosure rules were initially proposed in January 2006 and are 
effective for fiscal years ending on or after December 15, 2006 (see Appendix A for a timeline 
related to the new disclosure rules). Prior to 2006, proxy disclosures on the details of RPE 
contracts in the U.S. had been voluntary (Byrd et al. 1998; Carter et al. 2009). Under the new 
disclosure rules, firms are required to provide a "Compensation Discussion and Analysis" 
(CD&A) report in their proxy statements, in which firms must provide a detailed description of the 


: Specifically, Carter et al. (2009) find no evidence that economic determinants of RPE use (such as common risk 
exposure) affect firms’ decisions to use RPE in performance-vested equity grants as predicted by economic theories. 
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process used to set performance targets and an evaluation of how the performance targets translate 
into an objective compensation determination.’ This setting allows us to collect detailed data on 
explicit RPE contracts for a large sample of U.S. firms. 

Mandatory disclosures of RPE use, however, could alter firms’ decisions to use RPE and/or 
the implementation of RPE contracts. To the extent that the new disclosure rules attract public 
attention concerning the executive pay-setting process, firms could choose to restrain rent-seeking 
activities and thus potentially improve the compensation-contracting efficiency. Consequently, our 
findings might understate (overstate) the extent of rent seeking (efficient contracting) in periods 
prior to this new disclosure regulation. 


Sample Selection 

We begin with the S&P 1500 firms identified using Compustat’s annual file for fiscal year 
2006. We retrieve these firms’ first proxy statements filed under the new executive compensation 
disclosure regime.” Our initial sample consists of 1,419 firms with available proxy statements. 

We determine the use of RPE in executive compensation contracts by reading CD&A reports. 
Specifically, if the firm states that at least one component of executive compensation (e.g., annual 
bonus, restricted stock, stock option) is determined based on firm performance relative to a group 
of peers, the firm is identified as a RPE firm; otherwise, the firm is identified as a non-RPE firm. 
We take particular care to avoid misclassifying compensation-benchmarking firms as RPE firms. 
Specifically, if a firm uses peer groups to solely benchmark the level of pay without using peer 
groups to determine the performance target when evaluating firm’s performance, then the firm is 
coded as a non-RPE firm. Appendix B provides detailed coding criteria and representative proxy 
disclosures about RPE contracts in our sample. Our initial sample includes 361 RPE firms and 
1,058 non-RPE firms. When firms indicate that RPE is used in setting executive pay, they usually 
disclose details of the RPE contracts, such as the composition of the RPE peer group and specific 
performance metrics used for RPE.° We collect these detailed RPE contract disclosures as de- 
scribed in the CD&A section. 

Finally, we obtain CEO compensation data from ExecuComp, financial data from Compustat, 
stock return data from CRSP, and analysts’ target price forecasts from I/B/E/S. 


* SEC final rule 33-8732a, Item 402(b) (2) (v-vi) states, “In particular, firms need to disclose material information on 
what specific items of corporate performance are taken into account in setting compensation policies and making 
compensation decisions... how specific forms of compensation are structured and implemented to reflect these items of 
the registrant’s performance, including whether discretion can be or has been exercised (either to award compensation 
absent attainment of the relevant performance goal(s) or to reduce or increase the size of any award or payout), 


identifying any particular exercise of discretion, and stating whether it applied to one or more specified named executive’ 


officers or to all compensation subject to the relevant performance goal(s).” Nontransparent or vague disclosures of RPE 
use may attract SEC attention to be publicized in the SEC’s comment letters (see, e.g., the SEC’s comment letter on 
Blackbaud Inc. proxy filings dated October 28, 2008). 

* Most firms are subject to the new disclosure rules in fiscal year 2006. Since the new rules are effective for fiscal years 
ending on or after December 15, 2006, and firms are required to file their proxy statements within 120 days after the 
fiscal year-end, some firms are not subject to the new rules until fiscal year 2007. In our initial sample of 1,419 firms, 
37 firms fall into this category, among which 8 firms use RPE. For these firms, we collect information on the use of RPE 
from fiscal year 2007 proxy statements. For ease of exposition, we use 2006 to refer to the first year under the new 
disclosure rules throughout the paper. . 

6 Firms occasionally comment favorably on the principle of КРЕ, but do not disclose details on the use of RPE in setting 
compensation (e.g., no disclosure about the compensation components and performance metrics used for RPE and no 
details on the RPE peer group; see the last example in Appendix B). We classify these firms as non-RPE firms (66 

` firms), as under the SEC's new disclosure regime, firms are required to disclose the use and details of RPE contracts if 
they actually use RPE in setting executive compensation. Our results on firms’ decisions to use RPE are qualitatively 
similar if we exclude these 66 non-RPE firms from the analysis (untabulated). 
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Descriptive Statistics 


Table 1 provides descriptive evidence on the frequency of RPE use and on the main features 
of RPE contracts, Panel A reports that 361 (about 25 percent) of the sample firms use RPE at least 
to some extent in determining executive compensation. For comparison, Bannister and Newman 
(2003) report that 28 percent of their sample firms (which include 160 firms listed on the Fortune 
250 index) use RPE over the 1992-1993 period. For S&P 500 firms, which have comparable size 
to Bannister and Newman’s (2003) sample, we find a higher frequency of RPE use (about 38 
percent), which reflects increased RPE disclosure under the SEC’s new disclosure regime and/or 
more widespread use of RPE since the early 1990s.’ 

The fraction of RPE firms appears small compared to a theoretical prediction that RPE should 
be a widespread practice. We offer two explanations. First, we are only able to observe firms 
рге-соп пр to a formulaic explicit RPE contract, thereby excluding firms that use RPE im- 
plicitly through boards’ subjective discretion (e.g., Ferri 2009). Second, as mentioned in Section 
Ц, RPE also involves costs, so that firms are apt to use RPE only when the perceived benefits of 
RPE sufficiently outweigh the costs associated with using RPE. 

Table 1, Panel B reports the use of RPE by the type of compensation plans and performance 
metrics. As can be seen, firms most often apply RPE solely to equity-based compensation plans 
(214 firms, or 59 percent) and less frequently to cash compensation plans (82 firms, or 23 percent). 
This finding stands in contrast to the evidence reported by Murphy (1999), based on a survey of 
177 large U.S. firms in 1997 that RPE is sometimes used in annual bonus plans but rarely in 
equity-based compensation plans. Our finding also indicates heavier use of RPE toward equity- 
based compensation as compared with Bannister and Newman (2003), who report that 44 percent 
of their sample firms use RPE for long-term plans (including restricted stock, stock options, and 
long-term incentive plans), based on 160 Fortune 250 firms in the early 1990s. We also find a 
small percentage of firms (65 firms, or 18 percent) that use RPE in setting both equity-based 
compensation and cash compensation. 

Turning to performance metrics used in RPE contracts, Panel B of Table 1 shows that a 
majority of RPE firms (206 firms, or 57 percent) rely solely on price-based performance metrics 
(including stock returns and common shareholder wealth) when implementing RPE. Accounting 
performance metrics (such as return on equity and earnings per share) are also common in RPE 
contracts (82 firms, or 23 percent). Price-based RPE metrics are primarily used in equity-based 
compensation plans (150 firms out of 206 firms, or 73 percent), while cash compensation plans 
more frequently use accounting-based RPE metrics (35 firms out of 82 firms, or 43 percent). 
Among the remaining RPE firms, 60 firms (17 percent) use both price-based and accounting 
performance metrics for RPE purposes. An even smaller percentage of firms (13 firms, or 4 
percent) use nonfinancial performance metrics (such as customer satisfaction and market share) in 
RPE contracts. The infrequent use of nonfinancial metrics potentially reflects difficulties in ob- 
taining peer-firms’ nonfinancial information (e.g., Ferri 2009). We list commonly used RPE per- 
formance metrics in Table 1, Panel C. 

Firms that use RPE in setting equity-based compensation can employ RPE in a number of 
ways. Table 1, Panel D shows that a significant portion of RPE firms (about 43 percent) use RPE 


To better understand the effect of the new disclosure rules on firms’ disclosures of RPE use, we compare S&P 500 firms’ 
RPE disclosures before and after the new disclosure rules. For the 490 S&P 500 firms in our sample, the percentage of 
RPE firms in 2005 is about 31 percent (153 firms), compared to 38 percent (185 firms) in 2006. This result appears 
consistent with more firms adopting RPE in anticipation of more public scrutiny under the new disclosure rules and is 
also consistent with increased disclosure of RPE use under the new disclosure regime. 
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TABLE 1 
Descriptive Statistics of Relative Performance Evaluation Use in Executive 
Compensation Plans 
Panel A: Relative Performance Evaluation Use" 
Number of Firms Percentage (%) 

RPE firms 361 25.44 
Non-RPE firms 1,058 74.56 
Total 1,419 100.00 





Panel В: Relative Performance Evaluation Use by Type of Compensation Plan and Performance 
Metrics 
Price- 
Based and 
Price-Based Accounting Accounting 
Metrics Metrics Metrics Others Total 


Equity-based compensation plan only 





Restricted stock 126 26 17 3 172 
Stock option 3 1 1 0 5 
Restricted stock and stock option awards 21 7 6 3 37 
Subtotal 150 34 24 6 214 
Cash compensation plan only 32 35 12 3 82 
Equity-based and cash compensation plans 24 13 24 4 65 
Total 206 82 60 13 361 





Panel С: Commonly Used Price-Based and Accounting Performance Metrics for Relative 
Performance Evaluation® 


Number of Firms Percentage (%) 


Stock returns 266 73.68 
Return on equity 50 13.85 
Growth of earnings 42 11.63 
Earnings 39 10.81 
Growth of sales 24 6.65 
Return on asset 14 3.88 
Cash flow 13 3.60 


Panel D: Details of Relative Performance Evaluation Use in Equity-Based Compensation Plan? 
Number of Firms Percentage (96) 


RPE determines 
Vesting condition of performance share units (type 1) 120 43.01 
Size of restricted stocks and stock options (type 2) 94 33.69 
Vesting condition of restricted stocks and stock options (type 3) 61 21.86 
Both type 1 and type 3 2 0.72 
Both type 2 and type 3 2 0.72 
Total 279 100.00 
(continued on next page) 
The Accounting Review May 2011 


American Accounting Association 


Relative Performance Evaluation and Related Peer Groups 1015 


Panel Е: Relative Performance Evaluation Use by Type of Compensation Plan and Peer Group 
Choice 


Self- Market/Industry 
Selected Market/Industry Index and Self- 
Peers Index Selected Peers Total 
Equity-based compensation plan only 
Restricted stock 91 72 9 172 
Stock option 4 1 0 5 
Restricted stock and stock option awards 21 10 6 37 
Subtotal 116 83 15 214 
Cash compensation plan only 51 25 6 82 
Equity-based and cash compensation plans 42 19 4 65 
Total 209 127 25 361 


Panel F: Number of Peers in the Self-Selected Peer Groups and Industry Similarity and S&P 1500 
Identity for Self-Selected Peers 





Standard 
Mean Deviation 25% Median 75% п 
Number of self-selected peers 14.694 8.721 9 13 18 232 


Percentage of self-selected peers in the same 
two-digit SIC industry as the RPE firms (%) 60.92 33.39 34.85 66.67 91.67 232 


Percentage of self-selected peers in the same 
three-digit SIC industry as the RPE firms (%) 47.92 34.93 14.84 45.45 83.33 232 
Percentage of self-selected peers in the S&P 1500 


index (%) 70.37 22.42 58.58 7460 86.36 232 
Percentage of self-selected peers in the same 
S&P 1500 sub-index as the RPE firms (%) 45.11 27.77 23.67 40.00 67.71 232 


* ЕРЕ (Non-RPE) firms include 361 (1,058) S&P 1500 firms listed in the Compustat annual file for 2006 that explicitly 
disclose (do not mention) the use of relative performance evaluation in executive compensation plans in their first 
annual proxies filed under the SEC's 2006 executive compensation disclosure rules. We are unable to find proxy 
statements for 81 S&P 1500 firms. 

> “Equity-based compensation plan only" category includes firms that use relative performance evaluation in setting 

equity-baséd compensation (such as restricted stock and stock option). "Cash compensation plan only" category Ш- 

cludes firms that use relative performance evaluation in setting cash compensation (such as annual bonus and long-term 

incentive payout). "Equity-based and cash compensation plans” category includes firms that use RPE in both equity- 
based and cash compensation plans. 

"Price-Based Metrics" category includes firms that use price-based performance metrics (such as stock returns and 

shareholder wealth) to implement relative performance evaluation in compensation plans. “Accounting Metrics" cat- 

egory includes ürms that use accounting performance metrics (such as return-on-equity, earnings per share, earnings 
growth) to implement relative performance evaluation in compensation plans. “Price-Based and Accounting Metrics” 
category includes firms that use both price-based and accounting performance metrics to implement relative perfor- 
mance evaluation in compensation plans. “Others” category includes firms that use nonfinancial performance metrics 

(such as customer satisfaction and market share) to implement relative performance evaluation in compensation plans. 

See Panel C for a more detailed breakdown of performance metrics. 

The percentages sum to more than 100 percent as some firms employ multiple performance metrics when applying 

relative performance evaluation in compensation plans. 

“Vesting condition of performance share units" category includes firms that use RPE to determine the vesting of 

"performance share units (PSU)." Performance shares are similar to restricted stocks in that they fully vest at the end of 

а certain performance period starting with the grant year (usually three years). However, the range of vesting amount for 

PSU is usually between 0 percent and 200 percent (sometimes 150 percent), which is different from that for restricted 

stock units (RSU) capped at 100 percent. “Size of restricted stocks and stock options” category includes firms that use 

RPE to determine the size of granted restricted stocks and stock options. “Vesting condition of restricted stocks and 


(continued on next page) 
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stock options” category includes firms whose stock options or RSUs are vested conditioned on achievement of RPE 
goals. This category also includes firms for which the vesting schedule of stock options or restricted stock is accelerated 
if the company achieves a certain RPE target. 

“Self-Selected Peers” category includes firms that chose individual firms as the peer groups to implement relative 
performance evaluation in compensation plans. “Market/Industry Index” categcry includes firms that chose published 
market/industry indices as the peer groups to implement relative performance evaluation in compensation plans. 
“Market/Industry Index and Self-Selected Peers” includes firms that chose both published market/industry indices and 
individual firms as the peer groups to implement relative performance evaluaticn in compensation. 

Two firms mentioned the use of self-selected peers in implementing relative performance evaluation in executive 
compensation plans, but did not disclose the composition of the RPE peer group, which reduces the sample size to 232 
firms. On a firm level, there are 39 (25) RPE firms whose self-selected peer firms all come from the same two-digit 
(three-digit) SIC industry; there are 25 (10) RPE firms whose self-selected peer firms all come from the S&P 1500 index 
(the same S&P 1500 sub-index) as the RPE firms. 


a 


to determine the vesting of performance share units. A number of firms (about 34 percent) use 
RPE to determine the size of restricted stock and stock option grants. Other firms (about 22 
percent) link the achievement of RPE performance target to vesting conditions of restricted stocks 
and stock options. These statistics indicate that the use of RPE in equity grants now commonly 
involves vesting conditions among U.S. firms. 

Finally, we examine the choice and composition of RPE peer groups. Table 1, Panel E reports 
that a majority of RPE firms (209 firms, or 58 percent) use self-selected RPE peer groups, and 
about one-third of RPE firms (127 firms, or 35 percent) employ published market or industry 
indices. A small number of RPE firms (25 firms, or 7 percent) use both published indices and 
self-selected peer groups in RPE contracts. The tendency to use self-selected peer groups (as 
opposed to published indices) does not seem to differ notably across equity-based compensation 
plans and cash compensation plans. 

Table 1, Panel F shows that the mean (median) number of peers in self-selected peer groups 
is about 15 (13) firms, in contrast to a mean (median) of 60 (65) fixms in these firms’ correspond- 
ing two-digit SIC industry. Moreover, only 61 percent (48 percent? of self-selécted peers operate 
in the same two-digit (three-digit) SIC industry as the RPE firms. At the firm level, there are only 
39 (25) RPE firms whose self-selected peers all come from the same two-digit (three-digit) SIC 
industry. These statistics reinforce the concern that the key assumption underlying implicit tests— 
that the RPE peer group consists of all firms from the same industry—is inaccurate. Finally, about 
70 percent of self-selected peers belong to the S&P 1500 index, and about 45 percent share the 
same S&P sub-index (S&P 500, Mid-Cap 400, and Small-Cap 600) as the RPE firms. At the firm 
level, there are 25 (10) RPE firms whose self-selected peers all come from the S&P 1500 index 
(the same S&P 1500 sub-index). 

Table 2 provides descriptive statistics on firm characteristics, management attributes, and 
corporate governance for the sample firms. As shown, firms that disclose RPE use in executive 
compensation contracts differ from non-RPE firms along many dimensions, including competitive 


- * Performance shares are similar to restricted stocks in that they fully vest at the end of a specified performance period, 
starting with the grant year (usually three years). However, the vesting amount for performance shares ranges between 
0 percent and 200 percent (see the example of PG&E Corp in Appendix B), whereas that for restricted stocks is capped 
at 100 percent. 

9 А possible reason for a RPE firm to select peers outside its (primary) industry is that the RPE firm follows diversified 
business strategies and operates with multiple segments, which requires a diversified peer group to appropriately 
benchmark firm performance. We find that, of the 232 RPE firms that have self-selected peer groups, 157 firms (about 
68 percent) operate with multiple business segments. Multiple-segment RPE firms have roughly 61 percent (45 percent) 
of peers selected from the same two-digit (three-digit) SIC industry, while single-segment RPE firms have about 72 
percent (65 percent) of peers selected from the same two-digit (three-digit) SIC industry. 


The Accounting Review May 2011 
American Accounting Association 


Relative Performance Evaluation and Related Peer Groups 


1017 


TABLE 2 
Descriptive Statistics of the Sample 
Non-RPE Firms RPE Firms 
(n = 1,058) (n = 361) Meng tdem 
Mean Median Mean Median Differences Differences 
Firm Characteristics 
Common, Risk 0.285 0.257 0.384 0.370 —0.099*** — —0,113*** 
\Size_Rkadjl 5.348 5.081 6.325 5.969 —0.977*** —0,888*** 
IDiversity. Rkadjl 0.019 0.007 0.024 0.015 —0.005*** — —0.008*** 
IReturn, Rkadjl 0.049 0.024 0.040 0.022 0.009** 0.002 
Industry_Concentration 0.124 0.085 0.110 0.072 0.014** 0.013*** 
BM 0.600 0.602 0.682 0.692 —0.082*** —0,090*** 
Size 7.654 7.466 8.484 8.355 —0.830*** —0,889*** 
ROA_Indadj 0.014 0.000 0.005 0.000 0.009 0.000 
Return_Indadj 0.017 —0.007 0.019 -0001 —0.002 —0.006 
CEO Attributes | 
CEO_Wealth 8.972 9.868 9.435 10.011 —0.463 —0.143 
CEO. Age 55.882 56.000 56.053 56.000 —0.171 0.000 
Corporate Governance 
Top5_Instown 0.392 0.372 0.374 0.353 0.018 5 0.019** 
Activist_Instown 0.028 0.028 0.028 0.029 0.000 —0.001** 
CEO/Chair 0.601 1.000 0.659 1.000 —0.058* 0.000 
Board_Independence 0.817 0.857 0.857 0.889 —0.040*** — —0,032*** 
Board_Size 8.957 9.000 10.518 10.000 —1.561***  —1,000*** 
CompConsultant 0.798 1.000 0.904 1.000 —0.106*** 0.000 


Ж *% ЖУЖ Indicate significance at less than the 10 percent, 5 percent, and 1 percent levels, respectively, based on 
two-tailed t-tests (z-tests) on mean (median) differences. 

The sample includes S&P 1500 firms listed in the Compustat annual file for 2006 that have available proxy statements from 
the SEC’s EDGAR database. RPE (Non-RPE) firms include 361 (1,058) S&P 1500 firms that explicitly disclose (do not 
mention) the use of relative performance evaluation in executive compensation plans in their first annual proxies filed 
under the SEC’s 2006 executive compensation disclosure rules. А T 
Variables are measured over or at the end of fiscal year 2005 (2006) if a firm's first proxy filed.under the SEC's 2006 
executive compensation disclosure rules is for fiscal year 2006 (2007). Firm characteristics and CEO attributes (except 
Industry Concentration and СЕО Age) are winsorized at the top and bottom one percentiles. Number of observations 
varies depending on data availability. 


Variable Definitions: 
Common, Risk — proportion of firm-level stock return variance that is explained by value-weighted industry 
Stock returns, measured by the R? from regressing firm-level stock returns on value-weighted 
industry stock returns over the prior 36 months; 
\Size_Rkadjl = natural logarithm of the absolute difference between the firm's market value of equity and 
the median market value of equity for the firm's corresponding decile; 

IDiversity Rkadj| = natural logarithm of the absolute difference between the firm's segment concentration and the 
median segment concentration for the firm's corresponding decile. Segment concentration is 
the sum of the squares of each segment's sales as a percentage of the total firm sales; 

lReturn_Rkadjl = absolute difference between the firm's annual stock returns and the median annual stock 
returns for the firm's corresponding decile; 
Industry. Concentration = sum of the squares of the market shares of the firms’ sales within each two-digit SIC 
ind . 


ВМ = book value of assets divided by the sum of the market value of equity and book value of 
liabilities; 
Size = natural logarithm of the market value of equity; 
(continued on next page) 
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TABLE 2 (continued) 


ROA_Indadj = return-on-assets minus the median return-on-assets for the same industry (two-digit SIC 
code); 
Return_Indadj = buy-and-hold annual stock returns minus the median buy-and-hold annual stock returns for 
the same industry (two-digit SIC code); 
CEO_Wealth = natural logarithm of the value of equity (including both stocks and stock options) held by the 
CEO; 
CEO_Age = age of the CEO; 

CompConsultant = 1 if the firm uses compensation consultant in the pay-setting process (as disclosed in their 
first annual proxies filed under the SEC’s 2006 executive compensation disclosure rules), 
and 0 otherwise; 

Top5_Instown = stock ownership by the top five institutions as a percentage of total institutional ownership; 

Activist_Instown = percentage of holdings by activist institutions as defined by Cremers and Nair (2005); 

CEO/Chair = 1 if the СВО serves as Chairman of the Board, and 0 otherwise; 
Board, Independence = percentage of independent directors serving on the board; and 
Board_Size = number of directors serving on the board. 


environment, growth opportunities, common risk exposure, and market capitalization. In addition, 
RPE firms have different ownership structure and board attributes than non-RPE firms, and more 
frequently employ compensation consultants when setting executive compensation. We consider 
these differences when conducting multivariate analyses in Section V. 


IV. INCORPORATING EXPLICIT RPE CONTRACT DETAILS IN IMPLICIT TESTS 

Prior empirical studies generally employ an implicit approach to test for the prevalence of 
RPE use without referring to explicit RPE contracts. In this section, we examine the extent to 
which incorporating details of explicit RPE contracts affects inferences drawn under the implicit 
approach. Our approach, while incorporating important features of RPE contracts, still relies on 
assumptions regarding how RPE is actually utilized in setting executive рау.? Nevertheless, such 
evidence would highlight limitations of implicit tests and underscore the importance of consider- 
ing explicit RPE contract details in drawing conclusions about RPE use. 

We start by following Albuquerque (2009)’s implicit approach with one exception. 
Albuquerque (2009) included CEO fixed effects for her multiple-year sample period of 1992- 
2005. We estimate the following equation using S&P 1500 firms, where we limit the analysis to 
2006 because we only collect information about explicit RPE contracts for 2006: 


СЕОРау, = ag + a, FirmPerf,+ a,PeerPerf,+ osControlVariables, + e,. (1) 


In Equation (1), the dependent variable, CEOPay, is CEO total compensation, measured as 
the sum of salary, bonus, the fair value of stock option awards and restricted stock awards (per 
SFAS 123R), the change in deferred compensation, non-equity incentive plan compensation, and 
other compensation, in thousands of dollars (і.е., ТРСТ from ExecuComp). To mitigate skewness 
in CEO total compensation, we follow prior studies and use its natural logarithm in the estimation 
(e.g., Murphy 1999; Albuquerque 2009). The independent variables, FirmPerf, and PeerPerf, are 
12-month buy-and-hold stock returns during the fiscal year for the firm and its RPE peer group, 
respectively. Albuquerque (2009) proposes firm size as an important factor in RPE peer selection, 
while prior implicit tests utilize misspecified peer groups by combining same-industry firms with- 


10 Бог instance, we assume that all firms use stock return as the RPE-based performance metric and the median perfor- 
mance among peers determines the hurdle rate in achieving RPE-based performance targets. 
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out considering firm size. We follow Albuquerque’s (2009) recommendation and measure 
PeerPerf, based on the median stock return for firms in the same size quartile and same industry. 
If firms filter out peer performance in setting CEO pay, then we expect to find a, <0. 

Equation (1) also controls for firm characteristics and governance attributes that can influence 
CEO pay, including firm size, growth options, CEO tenure, regulated industry membership, idio- 
syncratic variance, CEO/Chair duality, CEO stock ownership, and whether the CEO is involved in 
an interlocking relationship. To account for potential outliers, we winsorize non-dichotomous 
variables at top and bottom one percentiles (except that idiosyncratic variance is winsorized at the 
top one percentile only). We further include industry indicator variables to account for variations 
in CEO pay at the industry level. 

Table 3 presents the results from estimating Equation (1). The first column reports results for 
our full sample of S&P 1500 firms. The sample size is smaller than that reported in Table 1, Panel 
A (reduced by 291 firms) due to (1) additional sample restrictions (excluding new CEOs who take 
the position in the middle of 2006 and excluding firms reporting negative total compensation), and 
(2) missing values on control variables. As shown, the coefficient on industry-size-matched peer 
returns is insignificant, inconsistent with RPE use.!! The lack of evidence in support of RPE use 
could result from infrequent use of RPE in 2006, as only 25 percent of our sample firms explicitly 
disclose the use of RPE in setting executive pay. If the infrequent use of RPE underlies the 
insignificant result above, then we expect to observe stronger evidence on RPE use for RPE firms 
than for non-RPE firms. Columns (2) and (3) report the results for non-RPE firms and RPE firms, 
respectively. We find that for both non-RPE firms and RPE firms, the coefficient on industry-size 
matched peer returns is insignificant. Hence, even among firms explicitly disclosing RPE use, the 
implicit test provides no evidence that these firms filter out peer performance when setting CEO 
pay. These results indicate that the infrequent use of RPE is unlikely to be the dominant reason for 
the insignificant evidence on RPE use in the implicit test. 

To further refine the implicit test, we consider disclosed RPE contract details. First, we 
incorporate RPE peer-group composition by measuring PeerPerf, based on the median stock 
return of self-selected RPE peers as disclosed in firms’ 2006 proxy statements. Table 3, Column 
(4) shows that the coefficient on self-selected peer returns is significantly negative, consistent with 
firms using these self-selected peers for RPE purposes when setting CEO pay. The contrasting 
results across Columns (3) and (4) highlight the importance of peer-group composition in testing 
for RPE use.’ The implicit test is likely to produce misleading results due to inaccurate identifi- 
cation of RPE peers employed in the pay-setting process. 

Second, we consider performance-based vesting, a design feature of RPE contracts that has 
gained popularity in the United States in recent years (Gerakos et al. 2007). The traditional 
implicit tests use current performance for RPE firms and their peers to explain executive pay. 
However, as Table 1, Panel D documents, firms often link RPE-based performance targets to 
vesting conditions of equity grants, so that future performance determines the value of equity 
grants (Gerakos et al. 2007; Carter et al. 2009). Under SFAS 123R, firms are required to report the 
fair value of performance-vested equity grants based on estimates of future performance for both 


И This result appears to be inconsistent with Albuquerque's (2009) main finding of a negative coefficient on industry- 
size-matched peer returns. This inconsistency is driven by our different sample period: Albuquerque’s (2009) sample 
period ranges from 1992 to 2005, while our. sample period is limited to 2006. 

12 The sample size under Column (3) is larger than the sample size under Column (4) because Column (4) focuses attention 
on RPE firms disclosing self-selected RPE peers. Results are qualitatively similar if we restrict to RPE firms disclosing 
self-selected RPE peers in Column (3). The sample size under Column (4) is smaller than the sample size reported in 
Table 1, Panel E (i.e., 209 firms with self-selected RPE peer groups) because of additional sample restrictions (excluding 
new CEOs who take the position in the middle of 2006 and excluding firms reporting negative total compensation) and 
missing values on control variables. 
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the RPE firms and their peers, as well as the associated likelihood of future vesting realizations. 
Because compensation committees’ projections of future performance can differ from current 
performance, the use of current performance in testing for RPE use could produce low-power tests 
7 when RPE-based performance targets are linked to future performance rather than current perfor- 
mance. The last two columns of Table 3 examine this issue. As shown in Column (5), when RPE 
use is linked to current peer performance, the coefficient on self-selected peer returns is signifi- 
cantly negative, supporting the use of RPE. In contrast, Column (6) shows that, when RPE use is 
linked to future peer performance (through equity grants’ vesting conditions), there is no evidence 
of RPE use. Hence, failure to consider this feature of RPE contracts is likely to confound empirical 
results based on implicit tests of RPE use.” 


У. USE OF RPE IN EXECUTIVE COMPENSATION CONTRACTS 
Prior studies have proposed a variety of factors that affect firms’ decisions to use RPE in 
executive compensation contracts. However, these studies rely on the implicit approach and 
mostly examine one contextual factor at a time without controlling for alternative factors. In this 
section, we employ explicit disclosures on RPE use to simultaneously examine multiple factors 
that influence the decision to incorporate RPE into executive compensation contracts. 


Determinants of RPE Use 
Common Risk 


Theory predicts that the risk-sharing benefit of RPE increases in the extent to which common 
risk affects both firm and peer performance (e.g., Holmstrom 1982).'* We thus expect firms to be 
more likely to use RPE when their performance is more heavily influenced by common risk. We 
measure common risk based on the proportion of the variance of firm-level stock returns that can 
be explained by industry returns, defined as the В? from regressing firm-level stock returns on 
value-weighted industry stock returns over the prior 36 months (Common_Risk). 


Availability of Similar Peers 


Tournament theory implies that including firms with unequal ability in the RPE peer group 
could induce executives to shirk or to pursue inefficient investments. Firms are thus less likely to 
use RPE if potential peers all exhibit dissimilar ability. We argue that firm size and industry 
diversification reflect organizational complexity, and firm performance reflects existing economic 
conditions, all of which affect executives’ ability to compete with potential peers. To measure the 
availability of peers with similar ability, we first divide firms within the same two-digit SIC 
industry into deciles based on market value of equity, segment concentration defined as the 
Herfindahl-Hirschman Index of segment sales (Rose and Shepard 1997; Bushman et al. 2004), or 
annual stock returns. The availability of similar peers is then measured by the absolute differences 
in market value of equity, segment concentration, and annual stock returns, respectively, between 
the firm and the median of the decile to which the firm belongs (ISize Rkadjl, |Diversity_Rkadjl, 
and |Return_Rkadj|, respectively). 


З Other features of RPE contracts relate to compensation components, performance metrics, hurdle rates in achieving 
RPE-based performance targets (e.g., median, 25 percent, or 75 percent of peer group performance), etc. Combining all 
of the features of RPE contracts in studying RPE use would substantially reduce our sample size and, hence, is not 
feasible to implement. 

^ While it is desirable to filter out common shocks for better risk-sharing, some firms may deliberately choose not to use 
RPE to motivate executives to manage risks resulting from such exogenous shocks (e.g., hedging in oil and gas 
industry). 
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Industry Competition 

Aggarwal and Samwick (1999) and Joh (1999) argue that firms facing a more competitive 
environment are less likely to use RPE due to the concern that RPE may encourage destructive 
competition. Following these studies, we use industry concentration, measured as the Herfindahl- 
Hirschman Index of sales within each two-digit SIC industry, as an inverse proxy for industry 
competition (/ndustry Concentration). 


Growth Opportunities 

Murphy (2001) predicts that firms with high-growth opportunities are more likely to adopt 
external standards (such as peer performance) in setting executive pay because internal standards 
(such as historical earnings growth) provide managers of high-growth firms stronger incentives to 
smooth performance in response to fears of budget ratcheting (Leone and Rock 2002). We use the 
book-to-market ratio (BM) as an inverse proxy for growth opportunities. 
Size 

Himmelberg and Hubbard (2000) use firm size to capture CEO talent. They propose that the 
reservation wage for talented CEOs covaries with the state of the economy, such that firms are less 
likely to filter out industry and market-wide performance when compensating talented CEOs (also 
see Rajgopal et al. 2006). Alternatively, firm size could serve as a crude proxy for public scrutiny 
and shareholder concerns about executive pay practices (Bannister and Newman 2003), which 


may induce firms to commit to explicit RPE use. We use the market value of equity to measure 
firm size (Size). 


Firm Performance 

Firm performance could also proxy for CEO talent and, hence, could be negatively related 
with RPE use (Himmelberg and Hubbard 2000). Alternatively, firms having superior performance 
could prefer RPE as a convenient tool to justify higher executive pay. We include industry- 
adjusted operating performance (КОА Indadj) and stock performance (Return Indadj) to capture 
performance effects on RPE use. 


Executive Attributes 


Garvey and Milbourn (2003) argue that the theoretical benefit of RPE is diminished to the 
extent that agents can hedge market risk. Hence. firms are less likely to use RPE when their CEOs 
display greater self-hedging ability. Following Garvey and Milbourn (2003), we use the value of 
CEO equity holdings (CEO, Wealth) and CEO age (СЕО Age) to proxy for CEOs’ self-hedging 
ability. 


Ownership Structure and Board Attributes 


Bertrand and Mullainathan (2001) document that CEO pay is positively associated with 
exogenous shocks and that such “pay-for-luck” is less pronounced in better-governed firms. 
Garvey and Milbourn (2006) further show that executive pay is more sensitive to good luck than 
to bad luck and that this asymmetry is more pronounced for firms with weaker governance. Since 
RPE can limit the pay-for-luck practice and also has risk-sharing benefit, we expect that RPE use 
is positively associated with the quality of corporate governance. We measure governance quality 
using institutional ownership concentration (Top5 Instown), activist institutional ownership (Ac- 
tivist_Instown) as defined in Cremers and Nair (2005), CEO/Chair duality (CEO/Chair), the pro- 
portion of outside directors on the board (Board Independence), and the number of directors 
sitting on the board (Board, Size). 
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Compensation Consultant 

Compensation consultants have access to proprietary information about industry-wide com- 
pensation practices and potential competitors (Cadman et al. 2010; Murphy and Sandino 2010). 
Since the design and implementation of RPE contracts demand specialized knowledge and exper- 
tise, the availability of professional advice from compensation consultants is likely to influence 
firms’ decisions to use RPE. We construct an indicator variable to identify firms that use compen- 
sation consultants in setting executive pay (CompConsultant), based on firms’ first proxy disclo- 
sures under the SEC’s new disclosure rules. 


Results on RPE Use. 


To examine firms’ decisions to use RPE in executive compensation contracts, we estimate the 
following multivariate logistic regression: 


Prob(RPE, = 1) = Ф( + a@,Common_Risk,_; + @2|Size_Rkadj],_, + a3|Diversity_Rkadj\,_ 
+ aa|Return, Rkadj|,., + asIndustry_Concentration,.., + асВМ, | 
+ ие, 1 + agROA, Indadj, , + asReturn Indadj,., + ао, СЕО Wealth, , 
+ а СЕО Аре, + @,2,Top5_Instown,_; + aj34ctivist Instown, 4 
+ a@4CEO/Chair,_, + ац Воага Independence, 4 + @gBoard_Size,_, 
+ @7CompConsultant, + е). (2) 


The dependent variable, RPE, is an indicator variable that equals 1 for RPE firms and 0 for 
non-RPE firms. The independent variables include firm characteristics, CEO attributes, and cor- 
porate governance factors that potentially influence the use of RPE. 

Table 4 reports the results from estimating Equation (2). We find that firms exposed to greater 
common risk are more likely to use RPE, in line with the notion that RPE is more effective at 
removing exogenous shocks in performance evaluation when the firm shares greater common risk 
with its peers (Janakiraman et al. 1992). In contrast to the prediction that more intense product 
market competition (lower industry concentration) discourages the use of RPE (Aggarwal and 
Samwick 1999), we find that firms from less concentrated industries are more likely to implement 
RPE in compensation plans. This result, however, is consistent with the evidence in DeFond and 
Park (1999) that a more competitive environment is characterized by a higher degree of common 
risk and, hence, brings greater benefits to those who use RPE.'° We also find that firms exhibiting 
fewer growth opportunities (higher book-to-market) are more likely to use RPE, which contradicts 
Murphy’s (2001) prediction but is in line with the argument in Albuquerque (2007) that RPE is 
less beneficial for firms with high-growth options, since peer performance is a less informative 
signal of external shocks for these firms. 6 

Furthermore, we find that larger firms are more likely to use RPE, consistent with firm size 
serving as a proxy for public scrutiny and shareholder concerns about executive pay. We also find 
that firms are less likely to use RPE when the value of CEO equity holdings is large, consistent 


5 DeFond and Park (1999) examine RPE use in management turnover decisions and find stronger evidence of RPE use in 
highly competitive industries than in less competitive industries. Our finding that Industry_Concentration (inverse proxy 
for industry competition) is negatively related to RPE use, even after controlling for Common, Risk, suggests that 
industry concentration may capture common risk above and beyond Common_Risk. 

16 Albuquerque (2007) argues that growth options affect а firm’s common risk exposure and, hence, the informativeness of 
peer performance about the firm’s common risk. Given we have controlled for Common_Risk, the positive relation 
between book-to-market and RPE use suggests that book-to-market may capture common risk above and beyond 
Common_Risk. 
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TABLE 4 
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Logistic Regression of Relative Performance Evaluation Use in Executive Compensation 


Independent Variables 
Firm Characteristics 
Common_Risk 
\Size_Rkadjl 
lDiversity, Rkad] 
IReturn, Rkad] 
Industry, Concentration 
BM 
Size 
ROA_Indadj 
Return_Indadj 


CEO Attributes 
CEO Wealth 


СЕО Age 


Corporate Governance 
Top5 Instown 


Activist, Instown 
CEO/Chair 
Board_Independence 
Board_Size 
CompConsultant 


Intercept 


Number of RPE/Non-RPE firms 


Percent concordant/discordant 
Pseudo R? 

Wald y? 

(p-value) 
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Contracts 


Predicted 
Sign 


+= 


Coefficient 


(p-value) 


255/640 
76.9/22.9 
0.255 
118.071 
(<0.001) 


Change in Probability 


О1 versus Q3 Values 


0.088 
—0.049 
0.039 
0.010 
—0.097 
0.118 
0.026 
—0.009 


—0.001 


—0.051 


0.014 


—0.010 
0.011 
0.048 
0.011 
0.071 


0.081 
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TABLE 4 (continued) 


The sample includes 895 S&P 1500 firms listed in the Compustat annual file for 2006 that have available information on 
relative performance evaluation use in executive compensation contracts and available information on regression variables. 
The dependent variable is RPE, which is equal to 1 if the firm explicitly reported RPE use in executive compensation plans 
in its first annual proxy filed under the SEC’s 2006 executive compensation disclosure rules, and 0 otherwise. See Table 2 
for other variable definitions. Firm characteristics and CEO attributes (except Industry_Concentration and CEO_Age) are 
winsorized at the top and bottom one percentiles. The “Change in Probability” column shows the change in the probability 
of using RPE as the result of moving from the first to the third quartile value of the variable of interest (for indicator 
variables, moving from 0 to 1), holding all other variables constant at their mean values (for indicator variables, the 
benchmark probability is determined with the 0 value). The reported p-values in parentheses are two-tailed. 


with Garvey and Milbourn (2003). Finally, firms with more independent boards are more likely to 
use RPE, consistent with stronger internal governance favoring the use of RPE, and firms that hire 

” compensation consultants are also more likely to use RPE, suggesting that compensation consult- 
ants’ specialized knowledge and expertise facilitate the use of RPE. Among all determinants, 
growth opportunities, common risk exposure, industry concentration, compensation consultants, 
and board size have a relatively large impact on RPE use." 

In summary, our evidence confirms the importance of common risk and CEOs’ self-hedging 
ability in firms' decisions to use RPE, and it shows that firms with more independent boards and 
firms employing compensation consultants are more likely to use RPE. However, some of our 
results contradict prior evidence based on implicit tests (such as the effect of industry competi- 
tion). We also find little evidence that the availability of similar peers affects firms' decision to use 
RPE. Taken together, the evidence suggests that firms consider both costs and benefits when 
deciding to use RPE in executive compensation contracts. A potential caveat is that firms could 
have adopted RPE contracts prior to the new disclosure regulation, and, hence, our causal infer- 
ences should be interpreted with caution. 

We acknowledge that firms can use RPE implicitly through boards’ subjective discretion, 
rather than pre-committing to a formulaic explicit RPE contract (Ferri 2009). We also note that the 
explanatory variables in Equation (2) likely capture factors that influence a firm's decision to use 
explicit versus implicit RPE. From the efficient-contracting perspective, when RPE can bring 
greater net benefits (e.g., higher common risk exposure), it is likely optimal for the principal and 
the agent to pre-commit to an explicit RPE contract. From the rent-seeking perspective, to the 
extent that more flexibility in the pay-setting process allows managers to reap greater private 
benefits, firms with weaker corporate governance (e.g., less independent boards) are less likely to 
commit to an explicit RPE contract. Furthermore, firms facing intense public scrutiny for trans- 
parent compensation disclosures (e.g., larger firms) are more likely to explicitly commit to RPE to 
cater to shareholder preference by reducing the extent of board discretion. We refrain, however, 
from drawing inferences on explicit versus implicit RPE use, as a thorough investigation of this 
topic is beyond the scope of our study. 


17 Firms may use RPE to determine anywhere from a substantial portion to a minimal amount of executive pay. While it 
is difficult to ascertain the amount of executive pay that directly results from RPE use, we develop a rough proxy for the 
extent of RPE use (RPE prop) based on the ratio of RPE-involved compensation (e.g., bonus and long-term incentive 
payouts, restricted stocks, or stock options) to total compensation for CEOs. We exclude RPE firms with RPE prop less 
than 10 percent, and re-estimate Equation (2). The untabulated results are qualitatively similar as those reported. We also 
estimate Equation (2) using a Tobit regression, where we replace RPE with RPE prop, and we obtain qualitatively 
similar results. 
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VI. SELECTION OF RPE PEERS 
The overall benefits of an RPE contract depend critically on how efficiently the RPE peer 
group is зејестед 19 In this section, we first discuss determinants of RPE peer selection. Next, we 
compare characteristics between firms that are chosen to be in the RPE peer group and those that 
are not chosen. We then conduct a multivariate regression analysis of the peer-selection process. 


Determinants of RPE Peer Selection 


Common Risk Exposure 

We measure potential peers’ common risk exposure based on the similarity in operational 
strategies and industry environment, proxied by same industry membership (Same_SIC2 and 
Same_SIC3), the similarity in the firms’ scale and thus flexibility in responding to external shocks, 
proxied by S&P 1500 membership (5Р1500) and S&P sub-index membership (Same SP), and 
performance comovement measured by the correlation in stock returns between a potential peer 
and the RPE firm of interest (CORR(PeerReturn, RPEReturn)). 


Similar Ability 

To capture similarity in ability, we construct |PeerMVE КРЕМҮЕ! and |PeerReturn_RPERe- 
шиті, defined as the absolute differences in market value of equity and annual stock returns, 
respectively, between a potential peer and the RPE firm of interest. 


Self-Serving Bias 

Since firms usually select peer groups used in compensation contracts at the beginning of the 
fiscal year (e.g., Cadman and Carter 2010; Faulkender and Yang 2010), we measure self-serving 
bias using potential peers’ expected performance at the beginning of 2006. Specifically, we con- 
struct PeerReturn_IndReturn, defined as a potential peer's expected stock performance minus the 
median expected stock performance for the RPE firm's two-digit SIC industry (excluding the RPE 
firm of interest). We measure expected stock performance as analysts' first consensus 12-month 
target price forecast (issued within 90 days after the year beginning) minus the prevailing stock 
price, scaled by the prevailing stock price. 


Symbolism 

To test for symbolism considerations in the peer-selection process, we construct PeerSales- 
_IndSales and PeerMVE IndMVE, defined as the differences in sales and market value of equity, 
respectively, between a potential peer and the median levels for the RPE firm's two-digit SIC 
industry (excluding the RPE firm of interest). 


Univariate Results 


To get a preliminary assessment of which firms are chosen to be members of a RPE peer 
group, for each RPE firm that discloses self-selected peers, we identify all domestic firms listed on 
Compustat that are not selected as RPE peers. Recall that about 74 percent of RPE firms use 
stock returns as their performance metric when implementing RPE contracts (see Table 1, Panel 
C). These firms are likely to use stock returns as the basis for selecting peer firms (in addition to 





18 We do not examine the peer-selection process of RPE firms that choose published market or industry indices as RPE 
peer groups (see Table 1, Panel E) because it is not feasible to test the selection of published indices due to difficulty in 
gathering the performance and composition information for a population of indices. We also do not examine firms' 
selection between published indices versus self-selected peer groups due to lack of related economic theories. 

19 Some of our sample firms select foreign firms in forming their RPE peer groups. We do not include foreign firms listed 

_ on Compustat, because Compustat only includes foreign firms cross-listed in the United States and thus does not provide 
an unbiased sample for foreign firms. In addition, Compustat collects firms' reported accounting numbers, while cross- 
listed firms may report financial statements using non-U.S. accounting standards. 


The Accounting Review | Мау 2011 
American Accounting Association 


Relative Performance Evaluation and Related Peer Groups 1029 


other considerations). To more accurately capture firms’ peer-selection process, we further limit 
this analysis to the 135 RPE firms that employ self-selected peer groups and use stock returns as 
the RPE-based performance metric.” 

Table 5 provides descriptive statistics on selected peers and unselected peers. As shown in 
Panel A, selected peers are more likely to operate in the same industry as the RPE firms, and are 
more likely to belong to the S&P 1500 index and the same S&P sub-index as the RPE firms. 
Furthermore, performance comovement between potential peers and the RPE firms is much higher 
for selected peers than for unselected peers. These results suggest that selected peers share com- 
mon shocks with the RPE firms to a greater extent than do unselected peers. Consistent with firms 
choosing peers with similar ability, we find that selected peers have closer size and stock perfor- 
mance to RPE firms than unselected peers. Panel A also reveals a potential peer-selection bias, in 
that selected peers have significantly lower expected stock performance than unselected peers. 
Finally, consistent with symbolism in setting CEO pay, we find that selected peers have larger 
sales and larger market capitalization than unselected peers. 

Table 5, Panel B compares selected and unselected peers conditional on whether peers are in 
the RPE firm’s industry. One intriguing finding is that unselected peers from RPE firms’ industry 
(within-industry peers) display higher performance comovement than selected peers outside RPE 
firms’ industry (outside-industry peers). Hence, cross-industry peer selection does not seem to 
improve performance comovement. Nevertheless, selected outside-industry peers have larger sales 
and market capitalizations than selected within-industry peers, consistent with RPE firms selecting 
larger firms outside their own industries for symbolism considerations. 


Multivariate Regression Results 


To further examine the selection of RPE peers, we estimate the following logistic regression 
for the 135 RPE firms that disclose self-selected peer groups and use stock returns as the RPE- 
based performance metric, with standard errors clustered by RPE firm: 


Prob(RPE. Peer, = 1) = (a + aSame_SIC2jj,_) + ахбате, SIC3,, |  o5SP1500;, 1 
+ a4Same  SP,y , + а; СОКК(РеетКешт,КРЕКешт) jj, 
+ a@|PeerMVE_RPEMVE| iji + @,|PeerReturn_RPEReturn| 1-1 
+ agPeerReturn_IndReturnyy,_ + agPeerSales. IndSales;, у 
+ @oPeerMVE_IndMVE),_\ + ец). (3) 


In Equation (3), the dependent variable, RPE Peer;;, is an indicator variable that equals 1 if 
a potential peer firm j (i.e., a domestic firm on Compustat) is chosen to be a member of the RPE 
peer group by RPE firm i for year t, and 0 otherwise. б 

Given that the relative proportion of selected versus unselected RPE peers is highly unbal- 
anced, the estimated coefficients from logistic regressions could be significantly biased and inef- 
ficient (Owen 2007). To alleviate this concern, we follow the commonly used undersampling 
method to randomly remove unselected peers when estimating logistic regressions (e.g., 
Chawla et al. 2003). Specifically, for each RPE firm, we randomly select a set of peers from 


20 The untabulated results based on all RPE firms with self-selected RPE peers are qualitatively similar to those reported 
in Tables 5 and 6. We do not conduct similar tests on RPE firms using accounting performance metrics because the 
heterogeneity in RPE-based accounting performance metrics does not allow a sufficiently large sample size to examine 
any individual accounting performance metric. Nevertheless, if we use earnings as a general proxy for RPE-based 
accounting performance metrics, we obtain qualitatively similar inferences regarding the RPE peer selection for firms 
using accounting performance metrics in RPE contracts. | 
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TABLE 6 


Logistic Regression of Peer Firm Selection for S&P 1500 Firms Using Self-Selected Peers 
in Relative Performance Evaluation and Stock Returns as the Performance Metric 


Panel A: All RPE Peers 


Predicted Coefficient Change in Probability 
Independent Variables Sign (p-value) О1 versus Q3 Values 
Common Risk Exposure 
Same_SIC2 + 2.394 0.416 
(<0.001) 
Same_SIC3 + 1.005 0.232 
(<0.001) 
SP1500 + 0.525 0.128 
(<0.001) 
Same_SP + 0.940 0.219 
(<0.001) 
CORR(PeerReturn, RPEReturn) + 0.246 0.049 
(0.001) 
Similar Ability 
ЇРеегМҮЕ, RPEMVH - -0.013 -0.033 
(<0.001) 
IPeerReturn, RPEReturnl = – 0.178 – 0.014 
(0.197) 
Self-Serving Bias 
PeerReturn IndReturn = —0.921 —0.038 
(<0.001) 
Symbolism 
PeerSales_IndSales + 0.019 0.024 
(0.016) 
PeerMVE_IndMVE + 0.017 0.028 
(<0.001) 
Intercept -1.416 
(<0.001) 
Number of selected/unselected peers 1,668/1,668 
Percent concordant/discordant 92.4/7.5 
Pseudo R? 0.668 
Wald y? 895.429 
(p-value) (<0.001) 


Panel B: Conditional on RPE Firms’ Stock Performance 
Underperforming RPE Firms Outperforming RPE Firms 





Change in 
Change in Probability Probability 
Predicted Coefficient О1 versus Q3 Coefficient Q1 versus Q3 
Independent Variables Sign (p-value) Values (p-value) Values 
Common Risk Exposure 
Same, SIC2 | + 2.841 0.445 2.068 0.388 
(<0.001) (<0.001) 
(continued on next page) 
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Panel B: Conditional on RPE Firms’ Stock Performance 


Independent Variables 
Same_SIC3 


SP1500 
Same_SP 


CORR(PeerReturn, 
RPEReturn) 


Similar Ability 
IPeerMVE RPEMVEI 


\PeerReturn_RPEReturnl 


Self-Serving Bias 
PeerReturn_IndReturn 


Symbolism 
PeerSales_IndSales 


PeerMVE_IndMVE 


Intercept 


Number of selected/ 
unselected peers 


Percent concordant/. 
discordant 

' Pseudo R? 

Wald x” 

(p-value) 


+ 


792/192 


93.1/6.7 
0.690 
436.588 
<0.001 


0.027 


0.028 
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Underperforming RPE Firms Outperforming RPE Firms 


Panel C: RPE Peers Not Used For Compensation Benchmarking Purposes versus 
Compensation-Benchmarking Peers Not Used for RPE 


Independent Variables 


Common Risk Exposure 
Same_SIC2 


The Accounting Review 


RPE Peers Not Used for 
Pay Ben 
Change in 
Probability 


Coefficient (1 versus 
(p-value) Q3 Values 


2.566 0.429 
(«0.001) 


in 
Probability 
Coefficient Q1 versus Q3 
(p-value) Values 
1.362 0.296 
(<0.001) 
1.094 0.249 
(<0.001) 
0.777 0.185 
(<0.001) 
0.267 0.057 
(0.004) 
—0.016 -0.034 
(0.069) 
-0.312 -0.026 
(0.132) 
—0.588 -0.026 
(0.018) 
0.020 0.023 
(0.012) 
0.019 0.029 
(0.008) 
—2.058 
(<0.001) 
876/876 
92.1/7.8 
0.665 
446.964 
<0.001 
Benchmarking Peers Not 
Used for RPE 
Change in 
Probability 
Coefficient QI versus Q3 
(p-value) Values 
2.104 0.391 
(<0.001) 


(continued on next page) 
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Panel C: RPE Peers Not Used For Compensation Benchmarking Purposes versus 
Compensation-Benchmarking Peers Not Used for RPE Purpose 


RPE Peers Not Used for Benchmarking Peers Not 
Pay Benchmarking Used for RPE 
Change in Change in 
Probability Probability 
Coefficient О1 versus Coefficient Q1 versus Q3 
Independent Variables (p-value) Q3 Values (p-value) Values 
Same, SIC3 2.044 0.385 0.545 0.133 
(<0.001) (0.119) 
SP1500 0.443 0.109 0.702 0.169 
(0.043) (<0.001) 
Same SP 0.852 0.201 0.804 0.191 
(<0.001) (<0.001) 
CORR(PeerReturn, RPEReturn) 0.259 0.051 —0.521 -0.103 
(0.068) (50,001) 
Similar Ability: 
|Рее'МУЕ_КРЕМУЕ – 0.021 —0.052 —0.009 -0.028 
(0.011) (0.002) 
ЈРеетКешт_КРЕКешт! – 0.306 -0.024 0.570 0.045 
(0.193) (0.004) 
Self-Serving Bias 
PeerReturn_IndReturn —1.236 —0.049 1.338 0.056 
(«0.001) («0.001) 
Symbolism 
PeerSales_IndSales 0.007 0.010 0.003 0.005 
(0.610) (0.688) 
PeerMVE_IndMVE 0.022 0.036 0.013 0.031 
(0.027) (<0.001) 
Intercept —1.349 –1.751 
(«0.001) (<0.001) 
Number of selected/unselected peers 511/511 72/472 
Percent concordant/discordant 89.3/10.6 90.4/9.5 
Pseudo RÊ 0.578 0.608 
Wald х? 228.344 229.045 
(p-value) <0.001 <0.001 





The sample includes selected peers and unselected peers (generated from random sampling) for the 135 RPE firms 
choosing self-selected peer groups and using stock returns as the performance metric in relative performance evaluation. 
The dependent variable is RPE Peer, which is equal to 1 if the firm (one Compustat domestic firm) is chosen as a peer for 
relative compensation evaluation purposes, and 0 otherwise. See Table 5 for the other variable definitions. “Underperform- 
ing (Outperforming) RPE Firms" include RPE firms with below-industry (above-industry) stock retums during the year 
prior to the SEC’s new executive compensation disclosure rules. ЇРевгМҮЕ КРЕМҮЕ and iPeerReturn RPEReturnl are 
winsorized at the top one percentiles. PeerReturn_IndReturn, PeerSales_IndSales, and PeerMVE_IndMVE are winsorized 
at top and bottom one percentiles. For ease of exposition, coefficients on PeerSales IndSales, PeerMVE_IndMVE, and 
iPeerMVE RPEMVEH are multiplied by 1,000. The "Change in Probability" columns show the change in the probability of 
being selected as a RPE peer as the result of moving from the first to the third quartile value of the variable of interest (for 
indicator variables, moving from 0 to 1), holding all other variables constant at their mean values (for indicator variables, 
the benchmark probability is determined with the O value). Two-tailed p-values in parentheses are based on clustered 
standard errors at the RPE firm level. The reported p-values in parentheses are two-tailed. 


domestic firms on Compustat that are not chosen by the RPE firm. For each RPE firm, we require 
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that the number of unselected peers in the random sample be equal to the number of selected 
peers. As reported in Table 5, Panel A, unselected peers generated from random sampling display 
similar characteristics to those of the population of unselected peers. To ensure the robustness of 
our results, we generate 100 random samples for each regression reported in Table 6, and find 
consistent results across each random generation.?! 

Table 6, Panel A reports results from estimating Equation (3). Consistent with the efficient- 
contracting view, firms that belong to the same industry as the RPE firm, firms included in the 
S&P 1500 index and from the same S&P sub-index as the RPE firm, and firms with higher return 
comovement are more likely to be chosen as RPE peers. We also find some evidence that similar 
firms are more likely to be selected as RPE peers—firms with closer size to the RPE firm are more 
likely to serve as RPE peers. With respect to selection bias, we find statistically significant evi- 
dence that firms with lower expected stock performance are more likely to be chosen as RPE 
peers. In line with the symbolism considerations, firms with larger sales and larger market capi- 
talization than the RPE firm's industry median are more likely to be chosen as RPE peers. 

Among all the factors affecting the peer-selection process, industry membership seems to 
have the greatest economic significance. For instance, firms from the same two-digit SIC industry 
as the RPE firm are about 42 percent more likely to be selected as RPE peers than firms are from 
different industries. S&P index membership also has substantial influence over the likelihood of 
being selected as а RPE peer. Performance comovement, size similarity, self-serving bias, and 
symbolism variables have smaller, but still significant, economic effects on RPE peer selection. 
While the peer-selection bias seems to have small economic significance, we are uncertain about 
the ultimate impact of peer-selection bias on executive compensation. Taken together, our findings 
support both efficient-contracting and rent-seeking behavior in the RPE peer-selection process. 

We next examine the potential influence of RPE firms' performance on the relative impor- 
tance of efficient contracting versus rent extraction in peer selection. We posit that underperform- 
ing managers are more concerned about job security and pay reduction, and, hence, have stronger 
incentives to manipulate the peer-selection process. To examine this issue, we re-estimate Equa- 
tion (3) among RPE firms with below industry median versus above industry median stock returns 
during the year prior to the SEC's new disclosure regulation. The results are reported in Table 6, 
Panel B. Consistent with our conjecture, we find that the negative relation between potential peers' 
expected performance and the likelihood of being selected as a RPE peer is more (less) pro- 
nounced for RPE firms that underperform (outperform) their industries. Moreover, the positive 
impact of stock performance comovement over RPE peer selection is significant among outper- 
forming RPE firms but insignificant among underperforming RPE firms. These findings suggest 
that superior performance leads efficient-contracting considerations to dominate in the RPE peer- 
selection process, probably due to managers worrying less about job security and pay reduction. In 
contrast, peer selection by underperforming firms appears to reflect rent-seeking behavior, prob- 
ably because managers are concerned about reputation loss and pay reduction and thus have 
stronger incentives to manipulate the peer-selection process. 


21 For instance, with respect to results reported in Table 6, Panel A, across the 100 random-sample regressions, the 
coefficients on Same 5122, Same 5ІС3, SP1500, Same, SP, IPeerMVE, RPEMVEI, PeerReturn IndReturn, and Peer- 
МУЕ  IndMVE are statistically significant as predicted for all 100 regressions. The coefficient on PeerSales_IndSales is 
statistically significant as predicted for 95 regressions. The coefficient on CORR(PeerReturn, RPEReturn) is statistically 
significant as predicted for 93 regressions. Finally, the coefficient on lPeerReturn, RPEReturnl is statistically significant 
as predicted for 44 regressions. 

22 Given that selected peers have larger sales and larger market capitalization than unselected peers, our findings may be 
confounded by size differences between selected and unselected peers. To mitigate this concern, we restrict the peer 
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RPE Peer Selection versus Compensation-Benchmarking Peer Selection 


Many firms use compensation-benchmarking peers in setting the level of executive compen- 
sation, for the purpose of gauging an executive’s reservation wage in the competitive labor 
market.” Recent studies suggest that firms select highly paid peers to justify excess CEO com- 
pensation (Bizjak et al. 2010; Faulkender and Yang 2010). RPE peers and benchmarking peers, 
however, not only differ in their theoretical justification, but also in the direction of peer-selection 
bias for peer performance: better performing firms tend to pay higher compensation and thus are 
more likely to be selected as compensation-benchmarking peers, whereas worse-performing firms 
are more likely to be selected as RPE peers to boost firms’ relative performance and, hence, 
incentive compensation. 

To shed some light on the peer selection for the purpose of RPE versus the purpose of pay 
benchmarking, we collect compensation-benchmarking peers for the 232 firms that use self- 
selected RPE peers in 2006. We find that 147 firms use two different self-selected peer groups” 
(one for RPE purposes and the other for pay-benchmarking purposes), with a mean (median) 
overlapping rate of 72 percent (81 percent), where the overlapping rate is defined as the number of 
common peers between the two peer groups divided by the size of the RPE peer group. 

We then re-estimate Equation (3) by limiting the sample to selected RPE peers not included in 
the compensation-benchmarking peer group, together with randomly sampled peers. Similarly, we 
re-estimate Equation (3) by limiting the analysis to selected pay-benchmarking peers not used for 
RPE purposes, together with randomly sampled peers. To be consistent with analyses above, we 
further require firms to use stock returns as the RPE-based performance metric. Table 6, Panel C 
reports the regression results. For the selection of RPE peers not used for pay benchmarking, we 
find qualitatively similar results to those reported in Panel A. Interestingly, we observe that per- 
formance comovement negatively influences the selection of pay-benchmarking peers. Moreover, 
peers’ expected stock performance has a positive impact on the likelihood of being chosen as a 
pay-benchmarking peer, consistent with firms favoring better-performing peers to boost executive 
pay. These results stand in contrast to the RPE peer-selection results, which support the notion that 
RPE peers and pay-benchmarking peers serve different purposes and their selection reflects dif- 
ferent considerations. 


Intertemporal Changes in RPE Peer Group Composition 


To provide further evidence on RPE peer selection, we analyze intertemporal changes in RPE 
peer-group composition. If peer selection conforms to efficient contracting, then we expect added 
peers to exhibit greater ability to remove common risk and greater similarity with the RPE firm 


population to S&P 1500 firms, and re-estimate Equation (3). This sample restriction ignores the fact that about 30 
percent of selected RPE peers do not belong to the S&P 1500 index (see Table 1, Panel F) which potentially lowers the 
test power. Nevertheless, we find qualitatively similar results as those reported in Table 6, Panel A (untabulated). We 
also find qualitatively similar results as those reported in Table 6, Panel B. In particular, the coefficient on PeerReturn- 
_IndReturn is significantly negative for underperforming RPE firms, but insignificantly positive for outperforming RPE 
firms (untabulated). 

23 Consistent with the efficient use of benchmarking peers, recent studies show that benchmarking peers represent а set of 
companies against which a firm competes for executive talent (Bizjak et al. 2008, 2010; Cadman and Carter 2010; 
Albuquerque et al. 2009). 

24 Starting with 232 firms, we are able to find 228 firms’ pay-benchmarking peer groups from reading CD&A reports. For 
these 228 firms, we find that 20 firms use either a broader definition of peers (such as an index) or proprietary 
compensation survey data for pay-benchmarking purposes. For the 208 firms that use self-selected pay-benchmarking 
peers, the mean (median) size of pay-benchmarking peer groups is 17 (15) peers. Of these 208 firms, 36 firms use the 
same peer groups for both pay-benchmarking and RPE purposes; 25 firms have RPE peer groups that are a subset of 
their pay-benchmarking peer groups; 3 firms use completely different peer groups for two purposes; and the remaining 
144 firms have some overlap between the two peer groups. 
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than dropped peers. If, on the other hand, firms choose poorly performing peers when forming 
RPE peer groups, then we expect added peers to exhibit worse performance than dropped peers. 

To identify the sample, we start with the 232 firms that disclose self-selected RPE peers in 
2006. We find that 180 firms continue using RPE in 2007, among which 155 firms disclose 
self-selected RPE peer groups. Of these 155 firms, 22 firms use the same RPE peer groups and 133 
firms modify their RPE peer groups in 2007, with 74 percent of 2006 RPE peers retained in 2007. 
The final sample includes 75 firms that both dropped and added RPE peers in 2007 and have 
available data on required variables. 

Table 7 reports differences in the factors affecting the selection of RPE peers between added 
and dropped peers. We find that added peers are closer in size to RPE firms than dropped peers, 
consistent with more (less) similar firms being added to (dropped from) the RPE peer group. 
Moreover, added peers’ industry-adjusted expected performance is significantly negative, whereas 
dropped peers’ industry-adjusted expected performance is insignificant. While this finding sup- 
ports self-serving bias in RPE peer selection, the contrast between added and dropped peers is 
statistically insignificant. This weak evidence could reflect low test power due to stable peer-group 
composition and/or increased public scrutiny of the peer-selection process after the SEC’s new 
regulation prompting firms to restrain opportunistic behavior in peer selection. 


ҮН. CONCLUSION 
We examine the explicit use of relative performance evaluation (RPE) and related peer groups 
based on S&P 1500 firms’ first proxy disclosures under the SEC’s 2006 executive compensation 


TABLE 7 


Mean (Median) of Variables Used to Explain RPE Peer Selection for Peers Added, 
' Dropped, and Unchanged in 2007 


Paired 
Peers Added Peers Dropped Peers Unchanged t-statistic 
a, E 55 
Common Risk Exposure 
Same 5ІС2 0.614 0.653 0.703 —1.63 
Same_SIC3 0.481 0.465 0.552 1.27 
5Р1500 0.720 0.707 0.789 0.15 
Same_SP 0.590 0.507 0.643 1.49 
CORR(PeerReturn, 0.357 0.424 0.394 -1.54 
RPEReturn) 
Similar Ability 
IPeerMVE КРЕМҮЕ 15,428.350 21,772.750 15,703.120 —1.68* 
\PeerReturn_RPEReturn| 0.232 0.253 0.222 -0.89 
Self-Serving Bias 
PeerReturn_IndRetu -0.035 -0.014 -0.038 —1.55 
Symbolism 
PeerSales_IndSales 10,252.390 14,182.150 11,710.980 —0.70 
PeerMVE_IndMVE 18,447.670 22,862.210 19,342.390 —0.52 


* Indicates significance at less than the 10 percent level based on two-tailed paired t-tests of the mean differences. 
Bold figures indicate significance level at less than 5 percent. 


The sample includes 75 firms that report two different RPE peer groups for 2006 and 2007 and have available data on 
required variables. All variables are measured at the beginning of 2007. PeerReturn is peer firms’ expected stock perfor- 
mance (based on analysts’ target price forecasts at the beginning of 2007). See Table 5 for the other variable definitions. 
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disclosure rules. We demonstrate that a lack of knowledge of both RPE peer-group composition 
and the link between RPE-based performance targets and future peer-performance cloud infer- 
ences drawn from implicit tests. These findings highlight the limitations of the implicit approach 
and underscore the importance of incorporating explicit RPE contract details in testing for RPE 
use. We also find that firms consider both costs and benefits of RPE as an incentive mechanism 
when deciding to use RPE. Analyses of self-selected RPE peers lend support to the efficient use of 
RPE as well as a self-serving bias in forming RPE peer groups.” In particular, RPE firms’ superior 
performance leads efficient-contracting considerations to drive the selection of RPE peers, 
whereas underperforming RPE firms exhibit greater rent-seeking behavior in the RPE peer- 
selection process. 

One limitation of our analyses is that we are unable to identify firms that use RPE implicitly 
without pre-committing to explicit RPE contracts. Future research could examine both explicit and 
implicit contracting to better understand RPE as an incentive device. Furthermore, while our study 
sheds light on target-setting in executive compensation contracts, our analysis likely omits other 
factors that boards consider in setting performance targets for executives. The peer-selection bias 
we document, for example, could reflect boards’ conscious efforts to balance various factors in 
setting performance targets (Merchant and Manzoni 1989). Examining interactions among various 
factors would be an interesting avenue for future research. 


APPENDIX A 
TIMELINE OF INTRODUCING THE SEC’S NEW EXECUTIVE COMPENSATION 
DISCLOSURE RULES | 


October 16, 1992 to January 27, 2006: Executive Compensation Disclosure 
The 1992 regulation eschewed a mostly narrative disclosure approach adopted in 1983 in 
favor of formatted tables that capture all executives’ compensation. 


January 27, 2006: SEC released the proposed rule Executive Compensation and Related Party 
Disclosure 
The proposed rule combines a broader-based tabular presentation with improved narrative 
disclosure supplementing the tables. 


January 27, 2006 to April 10, 2006: Open for public comments 
SEC had received more than 20,000 comment letters, most of which supported the new 
disclosure reform. 


August 29, 2006: SEC released the final rule Executive Compensation Disclosure 
August 29, 2006 to October 23, 2006: Open for public comments 


Effective Date: 


Companies must comply with these disclosure requirements in Forms 8-K for triggering events that 
occur on or after November 7, 2006 and in Forms 10-K and 10-KSB for fiscal years ending on or 
after December 15, 2006. Companies other than registered investment companies must comply 


25 Other than RPE peer selection, there exist alternative channels through which firms can influence the achievability of 
RPE-based performance targets and, hence, executive pay (such as through raising or lowering the hurdle rate of 
RPE-based performance targets or selecting different RPE-based performance metrics). We acknowledge that our infer- 
ences concerning the efficiency or opportunism in the RPE peer selection do not necessarily imply optimal or excessive 
executive pay, because alternative design features of RPE contracts (such as the hurdle rate or performance metrics) 
might either introduce greater rent extraction or offset the selection bias in the RPE peer-selection process. 
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with these disclosure requirements in Securities Act registration statements and Exchange Act 
registration statements (including pre-effective and post-effective amendments), and in any proxy 
or information statements filed on or after December 15, 2006 that are required to include Item 402 
and 404 disclosure for fiscal years ending on or after December 15, 2006. Registered investment 
companies must comply with these disclosure requirements in initial registration statements and 
post-effective amendments that are annual updates to effective registration statements on Forms 
N-1A, N-2 (except those filed by business development companies) and N-3, and in any new proxy 
or information statements, filed with the Commission on or after December 15, 2006. (directly 
quoted from the SEC) 


APPENDIX B 
CODING CRITERIA AND EXAMPLES OF PROXY DISCLOSURES ABOUT 
RELATIVE PERFORMANCE EVALUATION (RPE) 
Identifying RPE Firms from Compensation Discussion and Analysis (CD&A) Reports 


If a firm states that at least one component of executive compensation (e.g., annual bonus, 
restricted stock, stock option, long-term incentive payout) is determined based on firm perfor- 
mance relative to a group of peers, then the firm is identified as a RPE firm; otherwise, the firm is 
identified as a non-RPE firm. To be conservative, we classify firms that comment favorably on the 
principle of RPE, but do not disclose details on the use of RPE in setting compensation, as 
non-RPE firms. 

Since the use of compensation-benchmarking peer groups is much more prevalent than the 
use of RPE peer groups in S&P 1500 firms, it is important to distinguish firms using compensation 
benchmarking from firms using RPE. The SEC defines compensation benchmarking as “bench- 
marking of total compensation, or any material element of compensation” (SEC Final Rule 
33-8732a, Item 402(b)(2)(xiv)). Consistent with this definition, we classify firms using peer 
groups to benchmark the level of executive compensation as pay-benchmarking firms. Conse- 
quently, if our reading of the CD&A report for a sample firm suggests that the firm uses peer 
groups to solely benchmark the /evel of pay without using peer groups to determine the perfor- 
mance target when evaluating the firm's performance, then the firm is coded as a non-RPE firm. 

Below are excerpts from the CD&A reports in the first proxy statements filed after December 
16, 2006 for four companies. The first three companies are classified as RPE firms. The last 
company is classified as a non-RPE firm due to lack of details about RPE use in compensation 
contracts. Discussions about RPE use are bolded. 


General Electric Co 


Performance share units (PSUs). Since 2003, we have compensated the CEO with PSUs in lieu of 
any other equity incentive compensation... The receipt of shares underlying PSUs is determined 
entirely by the performance of the company against two key metrics: an internal metric that 
measures the cash-producing capability of the company and an external metric that measures the 
performance of the company against a broad market index... PSUs will convert into shares of 
GE stock at the end of the five-year performance period only if the specified performance objec- 
tives have been achieved. Half of the PSUs will convert into shares of GE stock only if GE's cash 
flow from operating activities, adjusted to exclude the effect of unusual events, has grown an 
average of 10 percent or more per year over the five-year performance period. Otherwise, they will 
be cancelled. The remaining PSUs will convert into shares of GE stock only if GE's total 
shareowner return meets or exceeds that of the S&P 500 over the flve-year performance 
period. Otherwise, they will be cancelled. 
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PG&E Corp 


In establishing levels of executive compensation, each year the Committee reviews the appropri- 
ateness of the comparator groups used to assess the competitiveness of PG&E corporation’s com- 
pensation programs (Pay Comparator Group) and PG&E Corporation’s corporate performance · 
(Performance Comparator Group)... The primary comparator group used for purposes of setting 
2006 officer compensation consists of all companies listed in the Dow Jones Utility Index and the 
Standard & Poor’s Electrics Index, and all California investor-owned utilities (Фе “Pay Comparator 
Group”)... This group of companies is broad enough to provide statistical validity and data avail- 
ability, represents the segment of the market where PG&E Corporation and Pacific Gas and Electric 
Company recruit officers with industry-specific experience, and is determined on an objective and 
transparent basis. For purposes of corporate performance comparisons (including the relative 
total shareholder return measured for the 2006-2008 performance share award cycle), the 
Committee uses a subgroup of 12 companies that have similar characteristics and business 
models as PG&E Corporation (the “Performance Comparator Group"): ‘Ameren Corpora- 
tion, American Electric Power, CenterPoint Energy, Inc., Consolidated Edison, Entergy Cor- 
poration, FPL Group, NiSource Inc., Pinnacle West Capital, Progress Energy, Inc., Southern 
Company, TECO Energy, and Xcel Energy. This group of companies is a subset of the Pay 
Comparator Group and, like PG&E Corporation, is focused on core regulated-utility activi- ^ 
ties with either a distribution or an integrated-utility focus. 


Performance Shares 


The payment for performance shares will be in cash and will be calculated by multiplying (1) the 
number of vested performance shares, (2) the average closing price of PG&E Corporation common 
stock over the last 30 calendar days of the year preceding the vesting date, and (3) a payout factor 
based on corporate performance... There will be no payout for TSR (“total shareholder re- 
turn") performance below the 25th percentile of the Performance Comparator Group; there 
will be a 25 percent payout if TSR is at the 25th percentile; there will be a 100 percent payout 
if TSR is at the 75th percentile; and there will be a 200 percent payout if PG&E Corpora- 
tion's TSR ranks first in the Performance Comparator Group. If PG&E Corporation's TSR 
is between the 25th percentile and the 75th percentile, or above the 75th percentile, award 
payouts will be determined by straight-line interpolation, adjusted to round numbers (i.e., the 
nearest multiple of five). 


Cleco Corp 


Our Compensation Committee uses two comparator groups to design executive-officer compensa- | 
tion plans and track comparable performance of those plans. These groups are referred to as 
_ comparator group(s), peer group(s), peers or the competitive market throughout this discussion, The 
- Base Peer Group was selected based on the companies being of approximate size and scope to 
Cleco (after regression analysis for size differences), employing similar labor and talent pools and 
having their executive officer compensation data being available to the outside independent con- 
sultant who analyzes the market data for the Compensation Committee. The Compensation Com- 
mittee considers the availability of such detailed market data to be critical in making comparative 
compensation decisions... The Incentive Peer Group was selected by the Compensation Com- 
mittee in order to measure the actual performance results of our incentive plans. The Incen- 
tive Peer Group is based on the companies being part of a recognized stock market index, as 
well as being in the same general industry classification system. This helps ensure the Com- 
pensation Committee evaluates our actual incentive plan performance against a group of 
companies whose scope of operations and market capitalization is similar to Cleco's. Data 
from the peer groups are an integral part of the decision process used by the Compensation 
Committee in determining tbe design, component parts and levels of awards contained in our 
executive officer pay programs. | | 


The Accounting Review May 2011 
American Accounting Association 


Relative Performance Evaluation and Related Peer Groups | 1041 


Вазе Реег Стоир 


For 2006 and 2007, executive officer compensation levels were evaluated using the Base Peer 
Group. This includes base salary, annual and long-term incentive plan targets, other potential equity 
awards and total compensation. The Base Peer Group consists of companies that are generally 
either in the Edison Electric Institute (“БЕГ”) Index or the S&P Small and MidCap Electric Utilities 
Index. We are included in both indices. 


Incentive Peer Group 


For 2006, the relative actual performance of the financial measures used in our annual and long- 
term incentive plans was determined using the Incentive Peer Group. The Incentive Peer Group 
consisted of the following 15 companies contained in the S&P Small and MidCap Electric 
Utilities Index: Allete, Inc.; Central Vermont Public Service; DPL, Inc.; Duquesne Light 
Holdings, Inc.; E] Paso Electric Company; Great Plains Energy Inc.; Green Mountain Power 
Corporation; Hawaiian Electric Industries; IDACORP, Inc.; Northeast Utilities; Pepco Hold- 
ings, Inc.; Sierra Pacific Resources; UIL Holdings Corporation; UniSource Energy Corpora- 
tion; and Westar Energy, Inc. The same 15 companies will be used as the Incentive Peer Group 
in 2007. 


Consolidated Graphics Inc 


Overall Compensation Philosophy and Policies 


Our compensation philosophy regarding members of the Board and the Company’s Executive 
Officers is to maintain compensation policies which align compensation with the Company’s over- 
all business strategy, values and management initiatives. The policies are intended to (1) reward 
individuals for long-term strategic management and enhancement of shareholder value; (2) support 
a performance-oriented environment that rewards achievement of internal Company goals 
and recognizes the Company’s performance compared to the performance of similarly situ- 
ated companies; (3) attract and retain individuals whose abilities are considered essential to the 
long-term future and competitiveness of the Company; and (4) align the financial interests of the 
Company’s directors and Executive Officers with those of the shareholders. 
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ABSTRACT: The monetary unit assumption of financial accounting assumes a stable 
currency (.е., constant purchasing power over time). Yet, even during perlods of low 
inflation or deflation, nominal financial statements violate this assumption. ! posit that, 
while the effects of inflation are not recognized in nominal statements, such effects may 
have economic consequences. | find that unrecognized inflation gains and losses help 
predict future cash flows as these gains and losses turn into cash flows over Чте. | also 
find significant abnormal retums to inflation-based trading strategies, suggesting that 
stock prices do not fully reflect the implications of the inflation effects for future cash 
flows. Additional analysis reveals that stock prices act as If investors do not fully distin- 
guish monetary and nonmonetary assets, which is fundamental to determining the ef- 
fects of Inflation. Overall, this study is the first to show that, although inflation effects are 
not recognized In nominal financial statements, they have signiflcant economic conse- 
quences, even during a period In which inflation Is relatively low. 


Keywords: inflation; asset pricing; information; financial reporting; abnormal returns; 
cash flows; capital markets. 


Data Availability: Data are available from public sources indicated in the text. 


I. INTRODUCTION - 
he U.S. financial reporting regime is nominal, which assumes no changes to the purchasing 
| power of the dollar over time. That is, the accounting amounts reported in financial state- 
ments based on U.S. Generally Accepted Accounting Principles (GAAP) are called nomi- 
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nal because they are not adjusted for inflation.’ Yet, the annual U.S. inflation rate over the past two 
decades has averaged to 3 percent. When purchasing power is not constant, a nominal reporting 
system that combines monetary amounts from different periods violates the Monetary Unit ac- 
counting assumption of a stable currency (SFAC 5, Financial Accounting Standards Board [FASB] 
1984) and decreases comparability both across firms and over time. Although the effects of 
inflation on financial reporting received considerable attention during the late 1970s when inflation 
was relatively high, inflationary effects have been mostly ignored in the more modest inflationary 
environment since that time. 

Nominal financial statements, by their nature, do not account for gains and losses attributable 
to changes in purchasing power over time. For example, whereas the erosion of a firm's monetary 
assets (e.g., cash) attributable to inflation is a loss to the firm, the erosion of a firm's monetary 
liabilities (e.g., debt) is a gain. Further, whereas inflation-adjusted amounts of nonmonetary items 
(e.g., land) accumulate inflationary effects over time to reflect changes in purchasing power, such 
effects are not recognized in nominal financial statements. The difference between inflation- 
adjusted and nominal earnings represents unrecognized gains and losses from inflation. In this 
study, I investigate the economic consequences of omitting these inflation effects from nominal 
financial statements. Specifically, I test whether unrecognized gains and losses from inflation help 
predict future cash flows, and whether investors incorporate this information into their investment 
decisions. 

This study is most closely related to inflation accounting studies that investigate the implica- 
tions of inflation in the U.S. during the 1970s and 1980s (e.g., Beaver 1979; Easman et al. 1979; 
Beaver et al. 1980; Gheyara and Boatsman 1980; Ro 1980; Watts and Zimmerman 1980; Beaver 
and Landsman 1983). These studies address questions primarily related to whether the effects of 
inflation are associated with contemporaneous annual and short-window stock returns. A conclu- 
sion from these studies is that inflation-adjusted (or, interchangeably, ТА) data are inconsequential 
for making financial decisions.? 

I take a different approach by considering the possibility that inflation can have ойсо 
over a period longer than the contemporaneous year. In particular, if unrecognized inflation gains 
and losses (or simply inflation gains, for brevity) are realized over time, then they are likely to 
help predict firms’ future cash flows.) Moreover, if the stock market does not fully account for 
such implications for future cash flows, then inflation gains can be associated with future stock 
‘returns. Thus, by focusing attention on possible longer-horizon inflation effects, I shed new light 
on the extent to which the stock market incorporates inflation information, and on the pricing 


1 [n the text I refer to inflation because deflation has rarely occurred in the U.S.; however, the study pertains to both 
inflation and deflation because deflation can be viewed as negative inflation. 

2 This research mainly investigates questions related to the adoption of the Securities and Exchange Commission (SEC)'s 
Accounting Series Release (ASR) 190 in 1976 and Statement of Financial Accounting Standard (SFAS) 33 in 1979, 
which are no longer effective, that were mandated in response to high inflation during the period. Another related area 
follows Modigliani and Cohn (1979), who hypothesize that the stock market is inefficient because it suffers from 
inflation illusion (e.g., Ritter and Warr 2002; Campbell and Vuolteenaho 2004; Basu et al. 2006; Chordia and Shivaku- 
mar 2005). Another area attempts to explain the association between stock returns and unexpected inflation (e.g., 
Summers 1981; Bernard 1986). Whereas prior studies in this area examine how raw returns vary with respect to 
unexpected inflation, I investigate how investors process inflation-adjusted information and whether ex ante portfolios 
can be formed to generate future abnormal returns. I also extend prior work by developing a measure that captures Ше. 
total information content of inflation that is omitted when reporting nominal amounts (e.g., purchasing dates for 
different asset classes, as explained in the Appendix), and in contrast to prior studies, which analyze the effects of 
unexpected inflation on realized returns, I analyze the effects of expected inflation on expected returns. Other related 
areas investigate inflation effects in a variety of contexts, including the bond market, non-U.S. countries with high 
inflation, employment determination, dividend decision, and inflation tax (e.g., Phelps 1967; Fama and Schwert 1977, 
Bar-Yosef and Lev 1983; Gordon 2001; Kothari and Shanken 2004; Davis-Friday and Gordon 2005). 

3 [use the terms "inflation gains" and “unrecognized inflation gains” interchangeably throughout the study. 
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implications of inflation-adjusted data. I also extend prior research by providing insight into how 
investors process inflation-adjusted information, thereby providing evidence on the mechanism 
tbrough which inflation impacts future investment decisions. 

The first question I address is whether inflation gains can help predict tirms' future cash flows. 
The inflation effects can translate to future cash flows because higher unrecognized inflation gains 
accumulated in nonmonetary assets should result in higher cash flows from operations when the 
assets are used (in the case of property, plant, and equipment [PPE]) or sold (in the case of 
inventory) through many types of business activities, leading to a: positive association between 
unrecognized inflation gains and future cash flows. 

I address this question using two tests. In the first test, I investigate how inflation distorts 
nominal amounts. To do so, I develop an algorithm that extracts inflation-adjusted data from firms’ 
nominal financial statements. More specifically, the algorithm adjusts nominal financial statements 
for inflation, firm-by-firm, using the distinction between monetary and nonmonetary amounts, the 
life cycles of assets and liabilities, and the clean surplus relation. An important feature of the 
algorithm is that it allows inflation effects to vary across firms and over time, capturing the fact 
that inflation affects firms differently depending on the structure of their assets and liabilities. 
Further, the algorithm and the inflation-adjusted amounts in this study rely on the historical cost 
measurement attribute, which has reliability and objectivity advantages. This is the same system of 
accounting that underlies the current nominal reporting regime, except that amounts are stated in 
common, units. This test evidences considerable cross-sectional and time-series variation in infla- 
tion effects. To verify the external validity of the algorithm, I implement it on a sample of Israeli 
firms for which I hand-collect inflation-adjusted and nominal data—Israeli firms were required to 
disclose both sets of figures until 2003. This verification analysis (see the Appendix) shows that 
the algorithm provides reasonable estimates of inflation-adjusted financial statement amounts. 

In the second test, I examine whether unrecognized inflation gains are realized over time, and 
thus can help to predict future performance. In particular, I test whether unrecognized inflation 
gains improve forecasts of cash flows from operations for horizons from one to four years relative 
to a benchmark model developed in Barth et al. (2001). I find that unrecognized inflation gains are 
associated with future cash flows from operating activities in each of the four subsequent years. 

Next, I focus on the stock market effects of inflation. If inflation-adjusted information has 
implications for future cash flows, then investors are likely to incorporate it into their investment 
decisions. However, using inflatton-adjusted data can be costly because inflation-adjusted data are 
not reported and processing such data is more complicated than processing nominal data (Beaver 
and Landsman 1983). Because mispricing can arise when information is costly to obtain and 
process (Grossman and Stiglitz 1980; Ball 1994), even under the semi-strong form of market 
efficiency, this raises the question of whether investors fully incorporate the implications of infla- 
tion information for future cash flows in their equity-pricing decisions. I therefore investigate 
whether unrecognized inflation gains can be used to generate significant abnormal returns. 

Specifically, given my findings that inflation gains have predictive power for future cash 
flows, future returns are likely to depend on stock market expectations regarding these gains. If the 
market fully incorporates information about inflation gains, then stock prices will correctly reflect 
the implications of such effects for future cash flows, leading to no future abnormal returns. 
Alternatively, if the market does not fully incorporate information about inflation gains, then 
Stocks may be mispriced, leading to possible future abnormal returns. Accordingly, I predict that 
if investors overestimate (underestimate) the amount of unrecognized inflation gains in the current 
period, investors will be negatively (positively) surprised when these effects are realized in cash 
flows, leading to negative (positive) subsequent abnormal returns. 

To test this prediction, I first conduct a portfolio-level return test. This test examines returns 
for portfolios constructed based on inflation-adjusted information, controlling for common risk 
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factors. More specifically, this test uses portfolios constructed based on inflation gains and inves- 
tigates the intercepts (і.е., alphas or abnormal returns) obtained from time-series regressions of 
future returns in excess of the risk-free rate on the related-period Fama-French factor returns. This 
test reveals significant intercepts for the high and low inflation-based portfolios, with a zero-cost. 
hedge strategy that results in a significant monthly abnormal return of 77-85 basis points, con- 
trolling for the Fama-French, momentum, and net operating assets factors. Thus, it appears that 
stock prices do not fully refiect the implications of inflation gains. I also find a negative associa- 
tion between inflation gains and future abnormal returns. This finding indicates that inflation gains 
are underestimated (overestimated) when current-period inflation gains are low (high), leading to 
the significantly positive (negative) abnormal future returns as the inflation gains turn into future 
cash flows. As a check, I also conduct a firm-level test that analyzes the extent to which an 
inflation-based trading strategy predicts firms’ future annual abnormal returns. The firm-level test 
generates abnormal returns in a manner consistent with the portfolio-level test. 

The findings show that, although investors take inflation gains into account, they do not do so 
correctly. In additional analysis, I conduct two tests to examine whether market pricing errors can 
be explained by how the stock market processes inflation information. I find that the documented 
ability to generate future abnormal returns and the negative association between inflation gains 
and future abnormal returns is consistent with an inflation adjustment argument. Specifically, I find 
that the presence of negative or positive future abnormal returns is consistent with investors not 
distinguishing monetary and nonmonetary assets when adjusting for inflation, leading to a loss of 
information because inflation affects these two classes of assets differently. This finding suggests 
that stock prices act as if investors “fixate” on aggregate assets when taking inflation into account, 
which leads to errors when adjusting for the effects of inflation. 

Robustness tests show that there is no pattern in risk characteristics across inflation gains 
portfolios, and that an inflation-based factor is not a priced risk factor. These findings indicate that 
the abnormal returns I document are not attributable to an omitted inflation-based risk factor. 
Instead, these findings are consistent with abnormal returns stemming from inflation information 
being costly to obtain and process and, hence, are consistent with market efficiency under costly 
information (Beaver 1981). 

To summarize, this study is the first to find that (1) unrecognized inflation gains are informa- 
tive for predicting future cash flows, (2) inflation-based trading strategies are associated with 
significant abnormal returns, meaning that stock prices do not fully reflect the implications of 
inflation gains for future cash flows, and (3) stock prices act as if investors do not fully distinguish 
monetary and nonmonetary assets, which is necessary to determine the effects of inflation. To- 
gether, these results suggest that, contrary to prior work that concludes inflation-adjusted data are 
of little consequence to financial decisions, unrecognized inflation effects have significant eco- 
nomic implications, even during a period in which inflation is relatively low. 

Section II discusses related background and develops the study's main tests on cash flows and 
stock returns. Section Ш presents the research design. Section IV discusses data and sample 
statistics. Section V provides the main results. Section VI investigates how investors process 
inflation information. Section VII presents robustness tests. Section VIII concludes. 





* Although 1 find that inflation-adjusted amounts are informative for predicting future cash flows, I do not propose a shift 
to an inflation-adjusted reporting regime. Mandating an inflation-adjusted reporting regime in the U.S. may impose 
public- and firm-level costs that do not necessarily outweigh the benefits, especially when inflation is low. A normative 
evaluation of the trade-off associated with such a regime shift is beyond the scope of my study. х 
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IL. BACKGROUND AND MAIN TESTS 

Inflation is a macroeconomic phenomenon that captures the decrease in purchasing power of 
a currency unit over time because of a general increase in the prices of goods and services (Beaver 
and Landsman 1983). The fundamental accounting assumption of a Monetary Unit assumes a 
stable currency (ie., constant purchasing power) as the unit of record (SFAC 5, FASB 1984). 
Under U.S. GAAP, if the only activities of a firm are, for example, purchasing a parcel of land for 
$100 fifty years ago and purchasing an additional parcel of land for $100 one year ago, the firm 
recognizes land at $200 in its financial statements. Assuming that the purchasing power changes 
over time, this results in a loss of information because the parcels were purchased with the same 
dollar amount, but at points in time with different purchasing power. Thus, although comparability 
is one of the qualitative characteristics of accounting information (SFAC 1, FASB 1978), mixing 
dollars from different periods distorts the Monetary Unit assumption and impairs comparability 
across firms and over time. This raises the question as to the economic effects of such inflation 
effects. 


Implications of Inflation Information for Future Cash Flows 


When a firm engages in a transaction, the opportunity cost of the transaction from the per- 
spective of a representative investor is the fixed number of general consumption units the firm 
gives up. However, financial statements, which present nominal amounts, do not include informa- 
tion about the date of each transaction, and items are recognized in terms of dollars and not in 
terms of consumption units. Therefore, inflation creates a wedge between a recognized nominal 
amount and its cost in terms of consumption units. For a firm, nominal amounts mix dollars from 
periods with different dollar-consumption ratios, which impairs comparability over time. Across 
firms, nominal reporting further impairs comparability because there is large variation among 
firms' transaction dates and amounts when purchasing power is not constant. 

To extract information about transactions in terms of consumption units, financial statement 
items can be separated into two classes: monetary and nonmonetary. Monetary items are directly 
measured on the basis of a fixed number of dollars required for their settlement. Nonmonetary 
items represent either a historical cost or a right (obligation) to receive (deliver) services for which 
purchasing power is not constant. Under 0.5. GAAP, financial statements do not take into account 
gains and losses on monetary items (e.g., debt) attributable to inflation. For example, although the 
erosion of monetary assets (liabilities) is a loss (gain) to the firm, such losses and gains are not 
recognized in U.S. GAAP-based financial statements. Financial statements also do not take into 
account the effects of inflation on nonmonetary items (e.g., PPE). For example, whereas inflation- 
adjusted amounts change over time because of inflation, the amounts recognized for gross PPE do 
not reflect such changes. The difference between earnings based on the two measures, 
IGL = IAEarnings — NominalEarnings, captures unrecognized inflation gains and losses. The 
variation in inflation-adjusted earnings, and hence іп IGL, is a result of the level of and changes in 
inflation, and the difference across firms and over time in the structure of monetary and nonmon- 
etary items. Thus, firms with similar nominal outcomes can differ in their inflation-adjusted out- 


5 An example inspired from The Economist's "Big Mac index" (http://www.economist.com/markets/bigmac/about.cfm) 
illustrates how inflation can lead nominal accounting amounts to not reflect a transaction's opportunity cost in terms of 
consumption units. Suppose S100 spent on a parcel of land 50 years ago could purchase 100 Big Macs 50 years ago, but 
because of inflation, $100 can only purchase 40 Big Macs today. Because the $100 spent 50 years ago is reported as 
$100 in the current-period's nominal financial statements, the accounting system equates the original amount spent by 
the firm to the amount required to purchase 40 Big Macs today. That is, the accounting system effectively informs 
investors that the firm gave up an amount corresponding to 40 Big Macs to purchase the parcel of land. Yet the fixm in 
fact purchased the land for the equivalent of 100 Big Macs. In inflation-adjusted terms, the $100 spent 50 years ago is 
economically equivalent to $250 (=100*100/40) in today's dollars. 
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comes, and JGL can vary for a firm over time even when the nominal amounts remain constant. 

Inflation gains can turn into future cash flows in several ways, depending on firms' activities. 
Because the underlying economics of firms' activities are the same regardless of how these ac- 
tivities are reported in financial statements, if the nominal financial statements do not fully capture 
the effects of inflation today, such inflation effects are likely to be realized in future periods, 
thereby enabling the inflation effects to help predict future performance. The inflation effects can 
turn into future cash flows from operations (CFO) because higher unrecognized inflation gains 
accumulated in nonmonetary assets should result in higher future CFO when the assets are used 
(in the case of PPE) or sold (in the case of inventory). Further, because inflation is correlated with 
changes in specific prices, predicting higher future CFO from increases in the general price index 
is consistent with prior literature that shows increases in specific prices result in higher CFO (e.g., 
Aboody et al. 1999). 

There are many types of business activities for which inflation gains can turn into future cash 
flows. Consider, for example, the case of PPE. Suppose a parcel of land acquired for $100 by the 
firm 50 years ago and, based on the purchasing power at that acquisition time, the firm gave up 
100 Big Macs (i.e., one Big Mac—consumption unit—costs $1). Suppose also that at the acqui- 
sition time, the firm expects to generate 10 percent inflation-adjusted annual yield by renting the 
land to another firm, i.e., it expects cash yield that 18 equivalent to ten Big Macs per year. Under 
а nominal reporting regime, firms' transactions are not linked to consumption units. Thus, when 
the purchasing power decreases over time, a particular number of consumption units in the past is 
equivalent to more cash in the present. Accordingly, in future periods, when the purchasing power 
of the dollar decreases because of inflation, the rent income is expected to increase, leading to 
higher CFO. Also, suppose a firm decides to use its land parcel for its own benefit rather than 
renting it to another firm. As the purcbasing power of the monetary unit decreases over time, the 
use of the land parcel, which is nonmonetary, allows the firm to avoid paying an increasing 
amount to rent the parcel from another firm. This allows the firm to avoid cash payments that are 
increasing over time. Another example is inventory. Inventory is often sold several months or 
years after it is purchased, depending on its turnover, which is a function of the firm's operations 
and business cycle. The inventory accumulates inflationary gains that result in higher CFO upon 
the inventory sale. Thus, overall, I expect that inflation gains on PPE and inventory translate to 
higher future CFO. 

If inflation gains, ІСТ, can turn into future CFO, then I predict a positive association between 
IGL and future CFO. I also expect inflation gains to turn into future cash flows over several years. 
First, inflation gains can be accumulated in nonmonetary assets (e.g., land, plant, and buildings) 
that usually have life cycles of several years. For example, unrealized inflation gains accumulated 
in a real estate property that is rented out can translate to CFO when the firm renews the lease or 
has another lessee beyond the one-year horizon. Second, inventory inflation gains can last longer 
than one year under LIFO, which many U.S. firms use, and when inventory turnover is longer than 
one year. 

Note that inflation-adjusted accounting amounts are not fair value amounts. In contrast to fair 
value amounts, management discretion and subjectivity do not play a role when adjusting for 


~ 


Inflation can lead to large differences between nominal and inflation-adjusted earnings even when inflation is low. For 
example, during a period of 6 percent average annual inflation in Israel, the median absolute difference between the two 
earnings amounts is 38 percent of nominal earnings. Also, during a period of 3 percent average annual inflation in the 
U.S., using the inflation-adjusted earnings for the U.S. sample, the equivalent ratio is 29 percent. 
7 For example, untabulated results suggest that monthly inflation rates are highly and significantly correlated with monthly 
changes in major indices of nonmonetary assets (commodity and housing) over the past 59 years of available data in the 
Global Insight database, with Spearman and Pearson correlations of 0.7 on average. 
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inflation. When adjusting for inflation, I rely on the same measurement attribute underlying finan- 
cial statements (mainly modified historical cost) and, thus, the adjustment procedure is objective. 
This is the same system of accounting as under the current nominal reporting regime, except that 
amounts are stated based on units of equal meaning. In particular, I adjust nominal amounts using 
a general price index to obtain a common monetary unit. The resulting (inflation-adjusted) 
amounts capture the implications of inflation for a representative investor who is interested in 
maintaining consumption units and is thus exposed to an average basket of goods and services.® 


The Extent to which Stock Returns Reflect Inflation Information 


Models in asset pricing often assume that an asset’s price depends on the comovement of its 
payoff with marginal utility from consumption over time and across states of nature. Nominal 
reporting affects investors’ assessment of firms’ activities in terms of consumption units. If infla- 
tion gains, which capture firms’ activities in terms of consumption units, are informative for 
predicting future cash flows, then this raises the question of whether the stock market fully 
incorporates such information when valuing firms’ equity. Investors are likely to use information 
if it has implications for future cash flows. However, the use of inflation-adjusted data is more 
complicated than the use of nominal data (е,2., SFAC 5, FASB 1984). Furthermore, because 
classes of assets and liabilities are affected differently by inflation, and the inflation impact varies 
over time based on the structure and age of a firm’s assets and liabilities, obtaining inflation- 
adjusted information (which is publicly unavailable) requires acquisition and processing costs that 
do not necessarily outweigh their benefits. These factors suggest that inflation accounting may 
reflect market efficiency under costly information (Grossman and Stiglitz 1980; Beaver 1981; 
Ball 1994). 

If unrecognized inflation gains turn into future cash flows, then future returns should depend 
on how investors process information about these effects. Thus, I expect to find no subsequent 
abnormal returns if investors fully incorporate inflation information into their investment deci- 
sions, Alternatively, if investors do not fully incorporate information about inflation gains, stocks 
may be mispriced, leading to possible abnormal returns. Specifically, if investors overestimate 
(underestimate) inflation gains in the current period, then investors will be negatively (positively) 
surprised in the future when these gains turn into cash flows, leading to negative (positive) 
subsequent abnormal returns. 


Ш. RESEARCH DESIGN 
Inflation Adjustment Algorithm 


To obtain accounting amounts that capture inflation effects, I develop an algorithm that 
converts nominal amounts into inflation-adjusted amounts on a firm-by-firm basis for a broad 
sample of firms.” The algorithm uses a constant dollar approach to measure firms’ activities as if 
purchasing power were constant. Further, it uses the same measurement attribute underlying 
nominal financial statements, restating the nominal amounts using an objective general price 
index. 


Another reason that I adjust for inflation using a general price index, rather than specific indices for particular categories 
of assets is to be consistent with inflationary GAAP, which uses the general price index. Further, adjusting using specific 
assets requires not-readily available data regarding the composition of all assets within each asset class and the asso- 
ciated specific index that captures the price increase in the specific asset. Also, adjusting using specific industry indices 
is likely to introduce significant measurement error because a firm’s composition of assets varies widely even for firms 
within the same industry. In fact, adjusting based on the general price index is likely to bias against finding significant 
abnormal returns and CFO predictability from using inflation gains. 

Prior studies usually focus on particular countries or short periods because of data limitations. By using an algorithm to 
adjust nominal amounts for inflation I am able to analyze inflation effects on a broader sample. 
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The algorithm is described in the Appendix. In brief, the algorithm extracts inflation-adjusted 
information from GAAP financial statements in three main steps. First, it reconstructs, in inflation- 
adjusted terms, each firm's financial statements over the firm's life until the reporting date of 
interest. Because the distinction between monetary and nonmonetary items is key to distinguishing 
the effects of inflation on different accounting amounts, the algorithm separates monetary from 
nonmonetary assets and liabilities. Second, because monetary items are measured in financial 
statements based on the fixed number of dollars required for their settlement, the algorithm uses 
the monetary amounts from the financial statements as the inflation-adjusted amounts. In contrast, 
the algorithm adjusts nonmonetary items by estimating purchase dates and using information on 
asset life cycles and inventory turnover. Third, the algorithm estimates the wealth generated each 
period, in inflation-adjusted terms, by differencing two successive sets of assets and liabilities and 
using the clean surplus relation to incorporate other comprehensive income amounts, stock issues, 
and other events that can be affected by inflation over the period (е.р., dividends). '? 


Implications of Inflation Information for Future Cash Flows 

If inflation-adjusted earnings are higher (lower) than nominal earnings in the current period, 
the unrecognized inflation gains (losses) should result in increased (decreased) future CFO. To test 
for the ability of inflation gains to predict future CFO, I extend the framework developed in Barth 
et al. (2001) to include inflation gains. Specifically, for horizons of one to four years, i.e., for 
т = 1 through 4, I estimate the following equation: 


CFO,,,= 0+ ВІСІ, + у: CFO, y: AAR, + 4: ММУ, + y4: AAP, + у; + DEPN, 

+ Ye: OTHER, + Hirr | (1) 
where ІСІ, denotes inflation gains and losses for year t; CFO, INV, AR, INV, AP, DEPN, and 
OTHER are cash flows from operations, accounts receivable, inventory, accounts payable, depre- 
ciation and amortization, and other accruals, respectively; and A denotes annual change. 

In Equation (1), the coefficient on inflation gains, 8, is the coefficient of interest. I predict a 
positive association between inflation gains and future CFO (В > 0). I also predict B to be positive 
for more than one year because assets can have life cycles longer than one уеаг.!! Following Barth 
et al. (2001), I predict the signs on у, У», Ys, Ys, and ys to be positive and the sign on уд to be 
negative. 

By construction, the sum of CFO and disaggregated accruals equals nominal earnings, and 
thus the equation includes time ¢ earnings. This allows me to test the incremental predictive effect 
of IGL, beyond time t nominal earnings. I estimate Equation (1) using a pooled regression that 
includes observations cross-sectionally and over time, as well as using annual cross-sectional 
regressions. For the pooled regression, 1 base test statistics on regression residuals clustered by 
firm and year (Petersen 2009; Gow et al. 2010). I include industry fixed effects in the annual 
cross-sectional regressions, and I report the mean coefficients of time-series parameter estimates 


19 Also, the adjustment procedure is consistent with inflationary GAAP that is currently active or was active in the past, 
e.g., International Financial Reporting Standards (IFRS): International Accounting Standards (IAS) 15 and 29, Interna- 
tional Financial Reporting Interpretations Committee (IFRIC) 7; Israeli GAAP: Institute of Certified Public Accountants 
in Israel (ICPA) Statements 36 and 50; and U.S. GAAP: SFAS 33 and ASR 190. 

My motivation stems from whether there is a positive association between current-period inflation gains and future cash 
flows, thus I focus on the direction of the coefficient on inflation gains, rather than its magnitude. At the extreme there 
could be a one-to-one relation between inflation gains and future cash flows. Yet, predicting the magnitude of this ` 
coefficient is outside the scope of my study, and it depends on an array of factors and parameters that could vary widely ' 
depending on the assumptions regarding these parameters and for which data are unavailable (e.g., the life cycles of 
nonmonetary assets, the relation between inflation rate and tbe increase in specific asset prices and their benefits, and the 
composition of nonmonetary assets). 
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obtained from the cross-sectional regressions. I conduct tests on these cross-sectional estimates 
using the Fama and MacBeth (1973) t-statistic and two Z-statistics. I primarily rely on the robust 
double-clustered pooled specification because Fama-MacBeth t-statistic and the Z-statistics can be 
overstated if cross-sectional and time-series correlations exist in the data. 


The Extent to which Stock Returns Reflect Inflation Information 


Portfolio-Level Approach 

If mispricing exists, then a relation between inflation gains and subsequent returns is likely to 
be evident in portfolio-level returns. To test this prediction, the design focuses on the intercepts 
from portfolios constructed based on inflation gains. The estimated intercepts permit testing the 
ability of inflation information to explain systematic differences in the cross-section of stock 
returns, controlling for common risk factors (Fama and French 1993). In particular, 1 test whether 
the intercept for the low ІСІ, portfolio is significantly different from the intercept for the high 
portfolio. To implement the test, I construct ten portfolios such that each period all firm-year 
observations with low (high) JGL are sorted into portfolio one (ten). I then calculate future 
monthly returns for each portfolio and estimate the following time-series equation at the portfolio 
level to obtain the portfolio intercepts: 


Кри Кет = др + Врмктке' МКТКЕ а + В, sup: SMBa В, uui: HML, + £55, (2) 


where К „т is the portfolio return for portfolio p in month т; Rf is the one-month Treasury bill 
rate; and МКТКЕ,, SMB,,, and HML,, are the Fama-French factor returns, where MKTRF is the 
excess return on the market, and $MB and HML are constructed based on MVE and BTM, respec- 
tively. Because inflation gains are estimated annually, I align firms’ IGL on monthly returns 
accumulated over the 12 months beginning three months after the fiscal year-end, to allow for 
dissemination of annual reports information and the associated annual inflation rate. 

To test for hedge abnormal return from using an inflation-based trading strategy, I construct a 
zero-cost investment portfolio that longs the lowest portfolio (portfolio one) and shorts the highest 
portfolio (portfolio ten). I then regress this zero-cost portfolio’s returns on the related-period factor 
returns. The intercept from this zero-cost hedge regression can be interpreted as a monthly abnor- 
mal return on a zero inflation-based hedge strategy that buys portfolio one and sells short portfolio 
ten. 

I also sequentially add as controls the Carhart (1997) momentum factor (UMD) and a net 
operating assets factor (FNOA) following Hirshleifer et al. (2004). Hirshleifer et al. (2004) show 
that the ratio of net operating assets to lagged total assets, which they refer to as balance sheet 
bloat, is associated with future returns. I add this effect in all return tests to control for the 
possibility that a relation between Net Operating Assets (ОА) and inflation gains can affect the 
association between inflation gains and future returns. To do this, I first obtain NOA following 
Hirshleifer et al. (2004), and then form а NOA-based factor following-the procedure described in 


12 Two points regarding this test. First, I focus on the inflation effects on future cash flows because cash flows receive 
considerable attention by the investment community and аге of major importance in valuation of firms (e.g. 
Hackel et al. 2000; Barth et al. 2001). Cash flows are also more difficult to manipulate compared to earnings. However, 
inflation can also affect nominal earnings because inflation gains can translate into future nominal earnings in several 
ways (e.g., inventory holding gains). Second, with respect to the Z-statistics, both Z-statistics control for cross-sectional 
correlations but Z2 also partially corrects for potential upward bias in 21 arising from lack of independence of param- 
eters across the regression groups (Barth 1994). Zi=[(1/ JN) ХАУ Ок (ку - 23), where t; is the t-statistic for 
cross-sectional regression j, Ку is the degrees of freedom, and М is the number. of cross-sectional groups; and 72 
= mean(t) / [stddev(r) / /(№-1)], where тлеап (7) and stddev(?) respectively refer to the mean and standard deviation 
across the group estimates. 
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Fama and French (1993) in forming the HML апі SMB factors. The FNOA factor is а factor- 
mimicking NOA portfolio.’ 

To the extent that inflation gains result in higher future cash flows, if investors correctly 
estimate inflation gains, then I expect the future abnormal return, œp, to be insignificantly different 
from zero. Alternatively, if investors underestimate (overestimate) inflation gains, then I expect a, 
to be significantly positive (negative), indicating positive (negative) subsequent abnormal returns 
as these inflation gains turn into higher- (lower-) than-expected future cash flows. 

As depicted in Figure 1, I adjust for inflation year / amounts using the nominal amounts and 
inflation time-series rates until year-end г. Thus, inflation gains are known at year-end f and 
investors have this information before 1+1 abnormal returns begin to accumulate. The association 
between current-period inflation gains and subsequent abnormal returns therefore depends on how 
the expected and unexpected components of inflation gains are estimated. Analogous to previous 
research, the source of the surprise that drives future returns, or the unexpected component of 
inflation gains at time f (and therefore the unexpected cash flows at 1+1), is the difference between 
inflation gains that are estimated correctly versus inflation gains that are estimated without distin- 
guishing monetary and nonmonetary assets. 

In other words, similar to the earnings surprise literature, the source of this study's surprise is 
the difference between the inflation adjustment that I estimate using the algorithm based on 
inflationary GAAP versus an adjustment that ignores the monetary-nonmonetary distinction.’ 
This type of analysis is analogous to that in Sloan (1996), who examines whether investors 
adequately distinguish the different persistence of the cash flows and accruals components of 
earnings in predicting future earnings." Also, my use of future abnormal returns to infer the 


13 Following Hirshleifer et al. (2004), I obtain МОА as NOA = RawNOA / TotalAssets,.,, where RawNOA = Operating 
Assets -- Operating Liabilities; Operating Assets = Total Assets (Compustat: AT) — Cash and Short-Term Investment 
(Compustat: CHE); and Operating Liabilities = Total Assets (Compustat: AT) — Debt Included in Current Liabilities 
(Compustat: DLC) — Long-Term Debt (Compustat: DLTT) ~ Minority Interests (Compustat: MIB) -- Preferred Stocks 
(Compustat: PSTK) - Common Equity (Compustat: CEQ). Next, I form a NOA-based factor following Fama and 
French (1993). At the end of each month, I sort all observations into two NOA groups, where group one (two) includes 
observations with low (high) NOA, and three book-to-market (BTM) groups, where group one (three) includes obser- 
vations with low (high) ВТМ. I then construct six portfolios (L/L, L/M, L/H, H/L, Н/М, H/H) from the intersections of 
the two NOA and three BTM groups, where the first letter in each of the X/X combinations refers to the NOA portfolio 
(Low, High) and the second letter refers to the BTM portfolio (Low, Medium, High). I then calculate monthly value- 
weighted returns on the six portfolios over the subsequent year, beginning three months after the fiscal year-end. ЕМОА 
is calculated each month as the average of the monthly returns on the three high NOA portfolios (H/L, H/M, and H/H) 
minus the average of the monthly returns on the three low NOA portfolios (L/L, L/M, and L/H). 

Therefore, what leads to future abnormal returns is whether investors understand the differential effect of inflation on 
monetary versus nonmonetary assets, rather than investor understanding of expected versus unexpected inflation. This is 
different from investors’ failure in the current period to distinguish expected and unexpected inflation. Specifically, when 
I estimate inflation-adjusted amounts, actual inflation is known because it is realized, as shown in Figure 1. Whether 
actual inflation is fully anticipated or fully unanticipated does not affect my predictions because investors use actual 
inflation, rather than its expected or unexpected component, to derive inflation gains. Further, the distinction between 
anticipated and unanticipated inflation is important when examining how current-period earnings changes explain 
contemporaneous stock price changes, under the notion that stock prices respond to the unanticipated inflation during 
the year—a setting that was widely used in the research design of inflationary accounting studies during the 1970s- 
1980s period. In contrast to this contemporaneous setting, my motivation and design are forward-looking, that is, in my 
study the events flow such that current-period (year г) inflation gains are estimated first, and only in the subsequent 
period (141) do these gains turn into cash flows. The subsequent returns thus do not arise from unanticipated inflation 
that affects year t inflation gains (see Figure 1). 

To the extent that inflation gains are perfectly correlated over time, there may be no surprise component when these 
inflation effects turn into cash flows over time and thus there may be no theoretical link between inflation gains and 
future returns. However, when I calculate serial correlations in [GL over periods t and (4-1, I find Pearson and Spearman 
correlations of 0.176 and 0.294, respectively. These correlations indicate that inflation gains show only weak persistence 
over time, as they have low serial correlation. Thus, there is no systematic relation among inflation gains over time. 
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FIGURE 1 
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The figure plots the timeline for analyses throughout the study, and the link between current inflation gains and 
subsequent abnormal returns. Inflation gains and losses (IGL) are estimated after the fiscal year-end, using 
information from nominal financial statements and actual inflation rates as of the end of the year. IGL is defined 
as the difference between inflation-adjusted earnings and nominal earnings (scaled). Subsequent abnormal re- 
turns are accumulated over the year beginning three months after fiscal year-end. Because inflation gains affect 
the realization of subsequent cash flows, the accuracy of estimating inflation gains affects the accuracy of ex- 
pectations regarding future cash flows. Subsequent returns are affected depending on actual versus expected 
inflation gains. 


expected versus unexpected components is consistent with prior studies (e.g., Bernard and Thomas 
1990; Sloan 1996). For example, similar to Sloan (1996), who infers the expected and unexpected 
persistence of accruals versus cash flows by examining future abnormal returns, I infer the ex- 
pected and unexpected components of inflation gains (which lead to a future surprise when these 
gains affect future cash flows) by examining the patterns in future abnormal returns and inflation 
gains estimated with and without making the monetary versus nonmonetary distinction. 


Firm-Level Approach 

As a check on the portfolio-level analysis, I also conduct a firm-level test to discern whether 
it is possible to earn abnormal returns from inflation-adjusted information. In particular, each year, 
I form ten portfolios such that firms with the lowest (highest) JGL are sorted into portfolio one 
(ten). Then, for each portfolio-vear, I calculate mean abnormal returns, accumulated over the 
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subsequent year. Ав before, I aligu firms' annual amounts to monthly returns in the next 12 months 
beginning three months after the fiscal year-end. 

I examine abnormal returns using four metrics. These are returns adjusted for the value- 
weighted return on the market (market-adjusted), for the Fama-French factors (Fama-French ad- 
justed), for the Fama-French and momentum factors (Fama-French-UMD), and for the Fama- 
French, momentum, and the net operating assets factors (Fama-French-UMD-FNOA adjusted). To 
obtain abnormal returns, Í first calculate raw returns by annually compounding each firm's 
monthly returns. The market-adjusted return is calculated as the annually compounded raw return 
minus the annually compounded value-weighted return on all NYSE, AMEX, and NASDAQ 
stocks in CRSP. Next, I estimate the following time-series equation for each firm: 


Rim - Rem = Kit В; мктаРМ КТЕЕ,, + В,ѕмв5МВ,, + В; нм ЯМІ + Sim- (3) 


I also estimate an equation similar to Equation (3) after sequentially adding as controls 
Carhart’s (1997) momentum (UMD) and the above-defined balance sheet bloat (FNOA) factors. 
Estimation of Equation (3), and its augmented version with UMD and FNOA, over the sample 
period yields firm-specific betas, Bi uxrrre Bi SMB Вінмі» Bump» and В ғол» which І winsorize at 
the top and bottom one percent. Finally, I obtain abnormal returns by subtracting from raw returns 
the product of a firm’s betas and the respective factor returns, compounded annually. 617 


| IV. DATA AND SAMPLE STATISTICS 

The sample covers U.S. firms with fiscal year-ends over the period 1984 to 2008, a period 
during which inflation was relatively low (average 3 percent). I obtain nominal accounting vari- 
ables from the Compustat North America Fundamental Annual database, XPF Format. I obtain 
monthly raw stock returns from the CRSP Monthly Stock File, and I adjust these for delisting 
returns following Shumway and Warther (1999) and Beaver et al. (2007). 8 Т obtain the risk-free 
rate, the Fama-French and momentum factors, and portfolio returns from the Fama-French Port- 
folios and Factors dataset available through the Wharton Research Data Services (WRDS), and the 
consumer price indices data used in the inflation adjustment procedure from the Global Insight 
(DRI) dataset's Basic Economics Monthly Series (PRNEW). 

To avoid extreme values obtained from deflating by a small denominator or from using a 
negative book value of equity, I delete observations with total assets, total revenues, or MVE lower 
than $10 million, and observations with negative book value of equity. To align forecasting tests, 
and to verify that annual amounts are for a 12-month period, I delete firm-year observations if the 
firm's fiscal year changed during the year. To mitigate the effects of using penny stocks, I delete 
stocks with stock price lower than one dollar. The accounting variables are deflated by market 


16 The firm-level test may lack power due to measurement errors in firm-specific factor betas and IGL. The portfolio test, 
in contrast, allows me to analyze the variation in the cross-section of expected returns. That is, rather than using 
firm-specific intercepts that depend on unknown firm-level characteristics, the portfolio test conditions on a pre- 
determined characteristic—inflation gains—and then identifies whether the inflation mispricing effects not explained by 
the factors vary with this characteristic. Thus, the portfolio test is less subject to this concem. 

The return analyses involve the interaction of returns, which are nominal, with inflation-adjusted amounts. For compa- 
rability with inflation-adjusted amounts, the inflation effects are largely purged from the return metrics both by sub- 
tracting the risk-free interest rate from raw returns and by analyzing the incremental effect beyond that contained in the 
market factor. To check the extent to which the risk-free interest rate is associated with inflation, I compute correlations 
between the monthly time-series amounts of inflation and risk-free rates. The Spearman and Pearson correlations both 
are 0.42 and significant, consistent with the risk-free rate absorbing a large portion of the inflation effects from the return 
metrics. Further, I design the analyses to be on a cross-sectional basis, estimated either monthly or annually. This is 
because in any given period inflation-adjusted returns are perfectly correlated with nominal returns because inflation is 
the same for all firms (Beaver and Landsman 1983). 

18 Inferences from all return tests are unchanged when I do not adjust for delisting returns. 


The Accounting Review May 2011 
American Accounting Association 


Inflation and Nominal Financial Reporting: Implications for Performance and Stock Prices 1057 


value of equity (MVE) at the beginning of the year, MVE, МЕН To mitigate the effects of outliers, 
I winsorize all variables in Table 1 at the top or bottom one percentile of the deflated value in each 
year. To reduce measurement error in deriving inflation gains, either from nominal earnings or 
from using the algorithm, I delete observations in the top or bottom one percentile each year of 
deflated nominal earnings, inflation-adjusted earnings, and inflation gains. I use the same industry 
classifications as in Barth et al. (2010). The final sample comprises 64,597 U.S. firm-year obser- 
vations. 

The main explanatory variable of interest in the analysis is IGL, ог JAEarnings minus Nomi- 
nalEarnings. NominalEarnings is Income before Extraordinary Items as reported in the financial 
statements, and JAEarnings is nominal earnings restated on an inflation-adjusted basis using the 
algorithm described in the Appendix. To obtain data necessary to estimate Equation (1), I proceed 
as follows. For observations with fiscal years ending after July 15, 1988, I obtain CFO from the 
statement of cash flows (Barth et al. 1999). Specifically, CFO is net cash flow from operations less 
the accrual portion of extraordinary items and discontinued operations reported on the statement of 
cash flows, and total operating accruals (ACCRUALS) are calculated as NominalEarnings minus 
CFO. The following are also calculated based on the statement of cash flows: change in accounts 
receivable (AAR), change in inventory (AINV), change in accounts payable and accrued liabilities 
(AAP), and depreciation and amortization (DEPN). If AAR, AINV, or AAP is missing, I calculate 
these items as I do for observations without statements of cash flows. The net of all other accruals, 
OTHER, is OTHER = NominalEarnings + DEPN — (CFO + AAR + AINV — AAP). For 
observations without a statement of cash flows available, that is, for observations with fiscal year 
ending before July 15, 1988, I derive CFO from accruals using information from the income 
statement and balance sheet (Dechow et al. 1995; Sloan 1996). To do so, AAR, AINV, and AAP are 
the change in the applicable balance sheet account (accounts receivable, inventory, and accounts 
payable plus accrued expenses, respectively), and DEPN is depreciation and amortization as 
reported in the income statement. ACCRUALS = AAR + AINV — AAP — DEPN + OTHER, 
where OTHER = [(ACA — ACASH) — ACL] ~ (AAR + AINV — AAP); ACA, ACASH, ACL are 
the respective changes in current assets, cash/cash equivalents, and current liabilities. Then, CFO 
= NominalEarnings — ACCRUALS. Additional variables I use include total assets (TotalAssets), 
net sales (Revenues), gross and net PPE (GrossPPE and NetPPE), and MVE, calculated as fiscal 
year close price multiplied by common shares outstanding. The Appendix includes details on the 
Israeli data I use for the validation analysis. 

Table 1 reports descriptive statistics for select variables. TotalAssets, Revenues, and MVE are 
presented in million U.S. dollars; Book-to-Market and TotalAssets / МУЕ, are given as ratios; 
Biuxtre Візмв» Pi ur, 884 B; up are the firms’ Fama-French and momentum betas, estimated 
based on Equation (3); and all other variables are deflated by МУЕ, |. Panel A reports statistics 
based on pooled firm-year observations. It shows that the mean (median) MVE,_, is $2.17 billion 
($217 million). TotalAsseis are usually larger than MVE, , with a mean (median) ratio of 2.23 
(1.30). The sign on the mean and median variables used in Equation (1) are generally consistent 
with prior research (e.g., Barth et al. 2001), with a positive sign on CFO, NominalEarnings, AAR, 
AINV, and DEPN, and a negative sign on AAP, ACCRUALS, and OTHER. Also, the means of 
NonMonAssets (which consists of NetPPE, INV, and Intangibles) and NetMonItems (monetary 
assets minus monetary liabilities), which are drivers in the inflation adjustment, respectively equal 


19 I use MVE as the deflator in the cash flows prediction model for two reasons. First, it is consistent with prior research. 
Second, my goal is to control for scale differences using a measure that is uncorrelated with my algorithm to avoid 
spurious results driven from the error in the undeflated models being correlated with the regressors. In the derivation of 
inflation-adjusted earnings, the algorithm uses several accounting amounts of the firm, so deflating by accounting items 
(e.g., total assets) may introduce spurious correlations with the accounting variables when predicting cash flows. 
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1.02 and —0.22, with respective standard deviations of 1.26 and 0.93, indicating a high dispersion 
in these measures and in /GL. As expected given the broad sample, the mean (median) market beta 
of 1.01 (0.99). The mean (median) of NominalEarnings is 0.05 (0.06), which is higher than the 
mean (median) of JAEarnings of 0.03 (0.04). The mean (median) ІСІ, is —0.02 (—0.01) with a 
standard deviation of 0.09." The standard deviations of IGL and IAEarnings reveal a large varia- 
tion in these measures. 

Table 1, Panel B reports means and standard deviations for each industry by year. With 
respect to major nonmonetary items, the panel shows that the utilities, extractive, transportation, 
food, services, mining, and construction industries are the most capital intensive, with NetPPE 
varying between 0.63 and 1.81 in these industries. Panel B also shows that monetary liabilities are 
larger than monetary assets in most industries, and that PPE(f), the expected remaining useful life 
(in months) for an average NetPPE item, is volatile, with the highest age of average assets in the 
food, textiles, printing, publishing, chemicals, durable, and retail industries (mean PPE(t) varies 
from 60.35 to 64.51 months in these industries). Even within industries, Panel B reveals high 
variation in monetary and nonmonetary items, which are drivers for inflation adjustments, sug- 
gesting high variation in inflation-adjusted earnings. 


V. MAIN RESULTS 
Implications of Inflation Information for Future Cash Flows 


Table 2 reports results from the cash flows analysis. It shows that the coefficients on aggre- 
gated accruals are consistent with Barth et al. (2001). It also reveals that В is significantly positive 
based on the t-statistics and Z-statistics in the pooled and cross-sectional specifications for all four 
of the forecasting horizons considered." In particular, based on the pooled and cross-sectional 
specifications, В is significantly different from 0 and equals 0.05 and 0.07 (0.05 and 0.08; 0.05 and 
0.11; 0.10 and 0.13), respectively, at the one- (two-, three-, four-) year horizon. This evidence 
suggests that inflation gains help predict future CFO. 


The Extent to which Stock Returns Reflect Inflation Information 


Table 3 reports results from the portfolio-level return analysis. The results show that five 
portfolio intercepts are significant, which indicates that forming portfolios based on inflation gains 
(G) generates significant abnormal returns. The intercept on the lowest portfolio is 0.0049 
(p « 0.001), whereas the intercept on the highest portfolio is —0.0036 (р = 0.002). The intercept 
of each portfolio can be interpreted as the monthly abnormal return from buying the specific 
portfolio and selling short a risk-free asset. Tbe table also reveals that, controlling for the Fama- 
French factors, the monthly zero-cost hedge return for a portfolio constructed on the difference 
between the intercepts for portfolios one and ten equals 0.00850 and is significant (p « 0.001). 
The table also reveals that this significant abnormal hedge return holds when controlling for 
momentum (UMD) and net operating assets (FNOA) factors in addition to the three Fama-French 
factors, with significant monthly zero-cost hedge returns of 0.00826 (p « 0.001) and 0.00767 
(p « 0.001), respectively. The findings indicate zero-cost abnormal hedge returns of 0.77 percent 
to 0.85 percent per month for using inflation information, controlling for common risk factors. 


20 As explained in the Appendix, 101, is normalized based on a reference point underlying the adjustment procedure. 
Because I adjust accounting amounts to be stated in terms of constant dollars to maintain the purchasing power at the 
estimated purchasing date of nonmonetary items, IGL is more frequently negative. Alternatively, IGL can be adjusted 
such that it is more frequently positive. The variation across firms and over time is unchanged under the two alternatives, 
and so are the cross-sectional results throughout the study. 

21 p-values of 5 percent or less are considered statistically significant, and all significance levels are one-tailed where I 
have predictions and two-tailed otherwise. 


The Accounting Review May 2011 
American Accounting Association 


1063 


Inflation and Nominal Financial Reporting: Implications for Performance and Stock Prices 


(a8pd тхәи uo panuguoo) 


100'0> 
е9 
1000> 
cg ST 
10070— 
LO'S 
8880 210 
100705» 
8979 
LOCO oro 




















мача 

















$85'0 1000» 1000» 
0€0— L9'L pL 
8220 100°0> 100'0> 
£L'0— LEIZ LEST 
5250 1000» 1000» 
єЄ0- ил, 8801 
100- 920 LTO 
01070 1000 1000» 
ЄЄТ- 988 6671 
900- 220 Sco 
THOAS 
62070 1000» 1000» 
Soz 069 LEST 
1000» 10007 1000» 
SYL 1667 09:58 
робо 100707 1000» 
STZ 101 18:11 
6070 ETO 620 
2510 1000» 10070 
01 £79 98791 
200- ГО Sco 
Odd 
- + + 
4УУ ААУ ХУУ 








042 

















90070 2660 
082 820- 
1000» Р8ЕО 
179 0£0— 
700`0 ПРО 
9076 STO- 
800 0070 
80070 100705» 
LLET 15:02 
2070 £0'0 
1000» 9520 
9vv L9'0 
100'0> 9810 
voL 260 
10005» 9810 
Ley 260 
1070 1070 
20070 1000» 
562 УГ 
2070 200 
+ é 
ЗЭ! 9 


БАӨГ ЧЕН amma 10) поцешыоуат подера) jo 58өрвэн Чи 
© A Ta vL 


попэтрәла 


[2101526-5$010) 


рә[оод 


18000296-580Ј7) 


popod 


May 2011 


American Accounting Association 


The Accounting Review 


Konchitchki 


1064 


{2804 ixou uo рэпицио2) 


























T0070 100707 89170 1000» 10070» 100705» 91070 3321 onyea-d 
9 6061 ‚ 660— 19 £€'6 PL bl 9Е@ 210- TZ 
100°0> 100`0> 18070 1000» 10070» 100702» 100705» 1570 enjeA-d 
6У/ 01 ПЕР 161- 1851 6512 [7222 799 110- IZ 
1000» 10070» €61'0 10070:» 10070» 1000» 80070 7870 enpeA-d 
©ТУ 9170 68:0- 189 ©Є`6 €9°ST 6972 +00 38053 WA 
010 ЕГО 660 70`0— 10 6€0 $50 єго 000 "1205 ues] Teuonosg-ssoT) 
1000» 10070» 9109 100'0:> 100°0> 100'0> 1000» 1000» эптел-4 
ил ДА STZ- 199 v9'6 OF ZZ 066 6012 38183 
6820 ого #01 600- є 0 РЕО 9c0 oro 909 “цоор pejoog 
7042 
100'0:> 100'0:> то 100'0> 100'0:> 10070» 80070 РО . en[a-d 
9[9 £F'6I ITI- сс9 896. 06791 СТЕ 020- 77, 
10002» 100'0:> 09070 10070» 100702 100'0:> 100'0:> €0r'0 anyea-d 
OF ZI 87 6F L8'1— €6'0C ГАКА 0Є`8© Spo ссо- IZ 
10090» 10070» ито 1000» 10070» 100/0:» 1000» 1970 эгвА-4 
68v L6 61 860— 829 #26 Р8'61 68€ 010- 3618-1 WA 
$6270 го 780 £00— ZEO Тео 1650 То 000 "пао uva гвчорогб-5500) 
100'0> 10070» 80070 1000» 1000 1000» 51070 1000> onyea-d 
go's LS'SZ Аа 22.6 оби LULZ 8Г2 Brez 18154 
9920 ого 180 100- 670 6270 250 so'o род "дао регоод 
зр но — - 
гру і + - + + + + 4 Uonoppedq 























JAHILO Add ЧҮҮ ААЛУ ЧУУ 042 TOI 0 


(penuguoo) z WIAVL 


May 2011 


The Accounting Review 
American Accounting Association 


1065 


Inflation and Nominal Financial Reporting: Implications for Performance and Stock Prices 


(с = ч) 8002 pue PRG] 12944324 spus-eoÁ [OST шил pue "вової pue sure цоперш отегпогво оз e[qu[TeAw вер 
әлвц тар чеізпішод pue 4940 Jo чопозвзани эф ur әле те 542013 “501 OVASWN PU? ‘KAWY ‘ASAN Пе Sepnpour ojdures оці, 5259):-1 pe[rej-ouo woy рашејцо аш sjuortjooo 
peappeid-u8is 10) sonpsA-d "8916 шрво 41048 эц $50108 попвілор prepus;s pus пвәш оф oj тэрэ! Крэлцоэд5э1 (длоррія pue (дивәш aoga "Г(Т — А)! / (улеррів) / (1) вэш = 27, 
рив ‘541038 узпороов-євого јо Joquinu оф 81 М pue "шоргоці jo вээгдэр amp st ^y ‘f поцввохдаг [20295-53012 Joy 20511648-1 om 51 т азоцм (2 - 9) 917 Пк (МЕ 11715 
IZ 'onsmus- (2161) фонову рив шид оф 01 813721 1615-1 УЦ :S918UIRSO (800Ц095-650:0 9Ц) 880208 рофијолв YUsloyjoos VEAU ош 0) зараз UBIA 1018210345 [8001109$-5$040 
эш 0102) Te 19 вя ur sornsnpu: ст оці по peseq eje вогдварат (1000) Te 19 preg по peseq ae pre ү ара, ur ратијар am којавшна [onuoo ayy, "unppoSpe om Зори 
stseq pojsnípe-uonegur ue o) poyeisal =8отолвә peururou sp SUPLY шор Areurploerxg 2:0104 IWONY 81 сдинитритом “sosso] pue sures попео peztuBooarun samduo у раз 
‘Ambo jo эсцвл зажив рэв8в| әф Aq popop ‘хЗинирзцеипиок — 8чишочу| = 10] SNAR Sunersdo шод smog 4580 st 0412 (Кил зор рэпгпо are jet зојаеџваА Jojeorput 
Aysnpur Surpnpour pue 8п01882139: (ЕНОЦО28-850:0 penaus) тво pus шлу Aq 8:0лэ 0015592801 Jo 8шлојапјо-ојапор Sursn pojeumso 81 uonesogioods (peuonoos-sso:») pojood оці, 

ХРТ = 93 


Url + ATHLO 7 + мача Я Дуул! HANIN 9 +'МУУ 4 +'042 МОГ +0 51040 
жоцепбо Зшмопој эц) јо 810:558:8а: (810Ц228-880:02 pue ројоод Зопешцвә wog 830891 suodar 2148) eu 


(penupuoo) Z WIAVL 


May 2011 


American Accounting Association 


The Accounting Review 


Konchitchki 


890°0 
2100 
2000 
L00'0— 
1100- 
9100- 
0200-- 
900 0— 
$€0'0— 


08070-- 


TOY ово 


1066 


(2804 јхәи uo рапицио2) 


(100'0:>) 
РЕР 
(10070) 
9175 
(10070) 
689 
(100'0:>) 
1 
(100°0>) 
#85 
(10002) 
ЖА 
(100'0>) 


(1000>) | (100'0>) 
суб ОГТР 
(10002) (1000:-) 
9g'LI 60 EP 
(1000-) (10009) 
УТ 161% 
(100702) (100702) 
2971 89766 
(10092) | (10002) 
11:81 66:66 
(0002) (1000>) 
єс Ра Ор 
(100'0>) | (Т0002) 
€S°9Z 80`06 
(10002) | (10002) 
ZBET 96:6р 
(1000-) (10002) 
YUST 167 
(10002) (0005) 
ZIT vl OP 
(әпүвА-@) 
205р815-) 


(10002) 
ILL 
(100`0>) 
618 
(10002) 
988 


(20070) 
0Г6- 
(100) 
921- 
(є0/:0) 
8€0— 
(£160) 
по 
(901'0) 


(100°0>) 
89% 
(10002) 
oF 
(100'0:>) 
8Ly 
(100702) 
LEP 
—— 





L9L00°0 (01-1) эЗрэц 1802-0197, 

:УОМ4 PPY ‘GWA PPY 

928000 (01-1) обрац 1502-0197, 

‘VON ON ‘GWN PPV 

0¢800°0 (01-1) әЗрәц 1802-0197, 

15101984 youalj-eure 7 

9807 9600:0-- 01 

5601 61000- 6 

06011 Р000'0— 8 

2860 100070 L 

2860 110070 9 

рої 060070 С 

8001 190070 у 

2660 ZP00'0 € 

9860 990070 2 

€£0'1 6700`0 | 

TURKS do оцорлод TOI 
зато бо у 


PAI] оцорлод гнорешлодії поцерат 39150 шолу 5иүлшэн јешлопа у 


Е ЯУЛЯУТ 





Мау 2011 


The Accounting Review 
American Accounting Association 


1067 


Inflation and Nominal Financial Reporting: Implications for Performance and Stock Prices 


"(SZ = п) 8002 pue у861 изомјед 5рпә-тгәА [2930 циг pue ‘525501 pue SUF поперш оувупогво о} ојаврвав вувр элвц ув ‘зато 
рив 4540 ЈО VORLON оці ur әле Teu $42035 7670 оуабум риє ‘KAWY ‘ASAN НӨ spnu әјішез әд], 518911 рэпом Woy pourgjqo are sonpsa-d “оцор:04 YON MOJ 
галр эц) uo sumo Арриош эц) Jo OFBAU оці ваш воцодзой PON 9819 ear эф uo sumar Апроош эц) Jo эВелэле оф se puou цоез poeno пес 81 VONA "pue-IuoÁ oosy 
өф тэцв приош дәл Занадэад eÁ Juonbesqns эц) 1240 soropn1od xis AP uo sumgar рээ гэл-эгцвл Апрпош зувпогво потр | 754018 jg гәл ров VON 0/1 әр Jo Suonoes1ojut 
om шоу sonopuod xr Sunonnsuoo рив 'sdnoi8 (Wg) зохгиш-о)- доод oonp pus $41018 YON ом) ойн $0018А12540 [Te ш цупоці yous Яков зву Aq рэшюу ‘опорой ром 
Sapyorumur-10199j 8 817 VON `(у00@) Те 32 зэлэүцэлн ur рэдиовэр sz разопляцог 'sjosse (510) ров Ве Ay рәјырәр gyesse Зицегодо JIU st VON “SCRUM Ч8полр JERBAL заввувр 8101284 
ров зоподзод qouaig-8Um oq) шод 8:0)08) штупошош рив цопазд-кшву оф рив эта: 991]-3sH оці, "Su mjol Зицёпер 10) peisnfpe ore pus “эрд 30015 Аио 4540 em Woy 
рэш до әле sumer 42015 ма: АПППОЈАГ "9181 а Ammseary, шшош-әпо әш st “уу "ауві ӨЭЦ-3811 оці, 810108; ропэ4-рэтеү21 эщ uo “оцорцоа 13248 эц) Затиоця pue оподіой somo] 
эц) Surguo[ зә рэп до әле yor ‘ошто: оподлой эЗрэц зпоццязаш 1800-0197 8013591391 WO әле soropiod ззэмо| oq pue 15948 П] әң TVA HUP эф чо pojonpuoo 5189) 
1991513 рив орозтодито оці, "9148; oq ut ^o se рәнойәз are 51015591801 9524} шолу 81йәолә oq], 510108) (VONA) 839888 Sunerado зэв эц) рив (GWA) WMU (L661) злеціво) oq 
sjanuoo se #шррв Árenuenbos "ТИН ‘AWS «ІМІМИ) 810126] qouonrg-vum, оф по развојној ore SUNJA ssooxo oropiod Апош озор pus 'оподюа звід ut 50008412540 qe Fursa 
ш доош цово родајпојво eo “Уу - "у чилцој ввоохо оподіой обеголе ‘рицу, "895801 pue sures uonegur pezruSooerun samidao 11 pm Чрафва8) sHurumo peupuou mupu supaga 
pajsnfps-uonsgut яр 707 "Кпраош poouspeqa (021) ouo опорлод озш рәнов тоу (4594814) 15940] эш ЧУМА porred цово роуплиоо are зоподюд по *puooog "puo-neaÁ (єо8) IP Jays 
syuow aoi Зиипидод вү}пош 21 оці J9A0 птиці 83781011258 потувліоздо 1894-1121) Y YA Suossa оподіой Апрпош 59155-20 Sureurso шод 510591 510491 91481 оці, 


(penuguo») $ ЯТЯУТ, 


The Accounting Review 


May 2011 


American Accounting Association 


1068 Konchitchki 


Untabulated results from a trend regression of the intercepts on ordered portfolios indicate that the 
decline in intercepts across the ten portfolios is significant (р < 0.001), suggesting a significantly 
negative relation between inflation gains and subsequent abnormal returns. 

Table 4 reports results from the firm-level return analysis. The table presents the annual mean 
abnormal returns, accumulated over the year subsequent to portfolio formation and aggregated 
over the lowest and highest ІСІ, portfolios. Similar to the results from the portfolio-level analysis, 
the firm-level results show that inflation information generates significant abnormal returns, with 
Statistically significant mean hedge abnormal returns over the sample period that vary between 8.8 
percent and 9.8 percent per year across the four abnormal return metrics considered. The results 
also show that, consistent with the results in Table 3, the low portfolio consistently yields higher 
subsequent returns than the high portfolio, with positive abnormal returns to a hedge strategy in 22 
to 23 of the 25 sample years. 

Figure 2 plots the four abnormal return metrics accumulated over the year subsequent to 
portfolio formation, illustrating the dominance of the low portfolio (solid lines) over the high 
portfolio (dotted lines) on average in about 90 percent of the sample years. This reflects a time- 
consistent pattern in future abnormal returns. я 

Overall, the findings of significant future abnormal returns provide consistent evidence that 
inflation-adjusted information is not fully incorporated by the stock market. 


VI. INVESTOR PROCESSING OF INFLATION INFORMATION 

The association between current-period inflation gains (JGL,) and subsequent abnormal re- 
turns (AbnormalReturn,,,) depends on how investors estimate IGL,. Figure 1 illustrates that ІСІ, 
is estimated using information before AbnormalReturn,,, begin to accumulate. If investors cor- 
rectly estimate IGL, then there is nothing unexpected with respect to the future realization of 
inflation gains in cash flows, such that no future abnormal returns are predicted (and hence no 
association between ІСІ, and AbnormalReturn,,, is predicted). Alternatively, if investors com- 
pletely ignore inflation, rendering as unexpected the entire future realization of inflation gains in 
cash flows, then it leads to a future positive surprise when inflation gains are realized. Such a 
surprise when ignoring inflation gains is commensurate with /GL, because the higher the (ignored) 
inflation gains are, the more favorable are future cash flows relative to investors’ expectations, 
leading to a predictable positive association between IGL, and AbnormalReturn,,,. 

However, the previous section provides evidence of (1) the existence of future abnormal 
return, and (2) a negative association between IGL, and AbnormalReturn,,,. Thus, investors appear 
to neither correctly estimate nor completely ignore inflation gains. This. raises the question as to 
whether, in attempting to adjust for inflation, investors make errors in doing so. Accordingly, I 
examine how inflation information is processed by investors. Prior studies indicate that investors 
“fixate” on aggregate amounts without distinguishing components of the aggregate amounts. For 
example, Sloan (1996) provides evidence consistent with investors “fixating” on aggregate earn- 
ings, failing to distinguish the different implications of the accrual and cash flew components of 
earnings for future performance. Similarly, because inflation affects monetary and nonmonetary 
assets differently, stocks prices will be affected if investors rely on aggregate amounts instead of 
distinguishing their different components. I therefore investigate how investors process the impli- 
cations of inflation gains by examining whether “fixating” on aggregate amounts without distin- 


22 The second column in Table 4 provides the annual inflation rate, calculated based on the PRNEW index from the Global 
Insight (DRI) dataset’s Basic Economics Monthly Series. The inflation rate can be obtained from different data providers 
with slight differences, yet different measures are often highly correlated. For example, annual inflation rates over my 
sample period calculated from the Global Insight and World Bank datasets are very close to each other, with a high and 
significant correlation (correlation = 0.82). 
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FIGURE 2 
Abnormal Returns from Using Inflation Information: Firm Level 
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The figure plots the annual mean firm-level returns reported in Table 4. The returns are accumulated over the 
year subsequent to portfolio formation, for the high and low decile portfolios formed each year based on IGL, 
such that observations with the lowest (highest) /GL are sorted into portfolio one (ten). JGL із inflation-adjusted 
earnings minus nominal earnings (scaled), and it captures unrecognized inflation gains and losses. 


guishing the monetary and nonmonetary components can explain the abnormal returns findings. 
Distinguishing monetary and nonmonetary assets is critical in inflationary accounting because 
inflation affects the two classes of assets differently. 

I conduct two tests to investigate this question. In the first test, I begin by using the algorithm 
to approximate inflation-adjusted earnings estimated without distinguishing monetary and non- 
monetary assets (denoted as JAEarnings NoDistinguish), by multiplying all assets by a constant as 
if there were no difference between the two classes of assets.~ I then test for a systematic 
association between future returns and the difference between [AEarnings_NoDistinguish and 
IAEarnings—i.e., the inflation-adjusted earnings correctly estimated by distinguishing monetary 
and nonmonetary assets. Using the relationship between future returns and the expected surprise to 
infer how investors process information is similar to the approach used in prior studies (e.g., 
Bernard and Thomas 1990; Sloan 1996). 

Table 5 reports the results. Panel A provides inflation-adjusted earnings with and without 
monetary and nonmonetary assets distinguished across IGL portfolios. It shows that not distin- 
guishing monetary and nonmonetary assets leads to overestimation of inflation gains and losses (in 


23 Because of the clean surplus relation, this is equivalent to multiplying the income of all firms by the same constant, 
which is a function of the annual inflation rate, without taking into account differences in firms’ asset composition. 
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absolute value), such that JAEarnings are overestimated (underestimated) in the low (high) IGL 
portfolio, which results in an increase in the difference between [AEarnings and JAEarnings No- 
Distinguish across IGL portfolios. Panel B provides the future abnormal returns (ар) reported in 
Table 3 across ІСІ, portfolios. Strikingly, the patterns in JAEarnings — IAEarnings NoDistinguish 
(hereafter, JAEarnings — IAEarnings_NoDistinguish is referred to as DIFF) and а, across the 
portfolios reveals that the two patterns are closely related, with the highest (lowest) future abnor- 
mal return arising when JAEarnings is at its highest (lowest) compared to JAEarnings NoDistin- 
guish. Specifically, not distinguishing monetary and nonmonetary assets leads to the greatest 
underestimation of inflation-adjusted earnings relative to their correctly estimated value (DIFF — 
0.0231) in the lowest JGL portfolio, which is consistent with the return findings in that the lowest 
IGL portfolio has the highest and most significantly positive future abnormal return (a, 
=0.0049; p < 0.001). Thus, the results provide consistent evidence that the highest future abnor- 
mal return is obtained when ІСІ, is most underestimated—i.e., when the future realization of 
inflation gains in cash flows is not fully expected, reflecting a future positive surprise when 
inflation gains turn into higher-than-expected cash flows. Investigating the highest JGL portfolio 
reveals similar consistency between inflation processing and future abnormal returns, with the 
lowest differential (DIFF = --0.0432) associated with the lowest and most significantly negative 
future abnormal return (а, = –0.0036; р = 0.002). This consistency between the patterns in Panels 
A and B is also reflected in Panel C, which provides results from regressing о, on DIFF and an 
intercept that is omitted from the table for brevity. The results show a significantly positive 
coefficient on DIFF (coefficient = 0.1507; p = 0.001). 

In sum, the findings from this first test reveal that the error from not fully incorporating the 
effects of inflation is associated with the documented pattern in future abnormal returns. This test 
therefore provides evidence that the negative association documented between inflation gains and 
future abnormal returns is predictable and consistent with an inflation adjustment argument. 

In the second test, I use a direct rational expectations framework similar to that employed in 
Mishkin (1983), Ball and Bartov (1996), and Sloan (1996). Specifically, to examine how inflation 
information is reflected in stock prices, I estimate the following nonlinear weighted least-squares 
system of equations: 


CFO,,, = & + & «ІСІ, + 6) · ІСІ, NoDistinguish, + б) · СЕО, + 64 - ACCRUALS, + [,, Return 
= (СЕО, – ду — 8| - IGL,— & · ІСІ, NoDistinguish, ~ 5, · CFO, – 6, - ACCRUALS,) 
+ Инт, (4) 


where the accounting variables are as defined above, and Return, is the adjusted return accumu- 
lated over year 1+1 and beginning three months after the fiscal year-end of year t. To control for 
heteroscedasticity, the first equation of the above system is scaled by the ratio of the residual 
variances of the two equations, where each equation's residual variance is obtained from an 
estimation of the given equation alone. Following my findings that /GL—constructed by distin- 
guishing monetary and nonmonetary assets—predicts future CFO, I predict 6, (6) to be signifi- 
cantly positive (insignificant). If stock prices correctly reflect the implications of inflation gains for 
future cash flows, then I expect ô (6,) to be significantly positive (insignificant), with the 
difference between the respective values of these two coefficients not significant. A significant 
difference between the coefficients on the inflation variables from the two equations would be 
evidence of investors not fully incorporating the implications of inflation information. 

Table 6 reports the results. The table reveals that, as expected, б, is significantly positive 
(5, = 0.0815; р = 0.010), 6, is insignificant (5 = —0.0242; p = 0.289), and the difference between 
6, and 6 is significant (p = 0.044 for the test: & = б»), consistent with the ability of inflation 
gains and losses to predict future cash flows. However, the table also reveals that 6, is insignifi- 
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TABLE 6 
Stock Price Reaction to Information in Current Inflation Gains and Losses about Future 
Cash Flows 
СЕО, = & + 61 IGL,* 6 IGL NoDistinguish, + 6: СРО, + б, ACCRUALS, + б 
Return, = |(CFO,,, – ду - бу "ІСІ, ~ 6; - IGL_NoDistinguish, - бу 
- СЕО, - à; ACCRUALS, + шал 


Estimate Asymptotic Standard Error x^-statistic Asymptotic Pr > д2 


& 0.0587 0.0009 66.00 <0.001 
8, 0.0815 0.0315 2.58 0.010 
8, -0.0242 0.0228 -1.06 0.289 
6 0.6912. 0.0057 120.71 <0.001 
8, 0.0825 0.0056 14.76 «10.001 
т 0.3814 0.0130 29.24 <0.001 
Ч —0.2514 0.0131 — 19.21 <0.001 
8 —0.4178 0.2723 —1.53 0.125 
5; 1.6302 0.2044 7.98 <0.001 
0.7215 0.0493 14.62 <0.001 
à 0.6675 0.0521 12.81 «0.001 
Tests 
8, = à 4.05 0.044 
8-6 20.01 «0,001 
8-0 10.27 0.001 
6-4 64.71 <0.001 


The table reports results from estimating a nonlinear weighted least squares system of equations to examine how inflation 
information is reflected in stock prices. CFO, is cash flows from operating activities for year 141, extracted directly from 
the statement of cash flows or derived from changes in balance sheet accounts as explained in Table 1. ACCRUALS, is total 
operating accruals, calculated as ACCRUALS, = NominalEarnings,- CFO, where NominalEamings is Income before Ex- 
traordinary Items as reported in the financial statements (Compustat: IB). ІСІ, captures unrecognized inflation gains and 
losses and is calculated as ІСІ, = [AEarnings — NominalEarnings, where IAEarnings is nominal earnings restated to an 
inflation-adjusted basis using the algorithm in a manner that distinguishes monetary and nonmonetary assets; this is the 
inflation gains and losses measure used throughout the study. /GL, NoDistinguish approximates unrecognized inflation 
gains and losses without distinguishing monetary and nonmonetary assets and is calculated as IGL_NoDistinguish 
= lAEarnings, NoDistinguish — NominalEarnings, where [AEarnings_NoDistinguish refers to inflation-adjusted earnings 
obtained using the algorithm without distinguishing monetary and nonmonetary assets. All accounting variables are scaled. 
Return, is adjusted stock return for year (4-1, accumulated over the year subsequent to year t and beginning three months 
after the fiscal year-end of year t. Raw stock returns are from the CRSP Monthly Stock File, adjusted for delisting returns. 
To control for heteroscedasticity, the first equation of the system is scaled by a scaling factor computed as the ratio of the 
residual variances of the two equations, where each equation's residual variance is obtained from estimating the equation 
alone. p-values are obtained from asymptotic Chi-squared tests. The sample includes all NYSE, AMEX, and NASDAQ 
U.S. stocks that are in the intersection of CRSP and Compustat, that have data available to calculate inflation gains and 
losses, and with fiscal year-ends between 1984 and 2008 (n = 25). 
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cant (ài 204178; р = 0. 125), & is significantly positive (6 = = 1.6302; р < 0. 001), the differ- 
ence between ô, and & is significant (р < 0.001 for the test: ô; = 85), and the differences in the 
coefficients on the inflation variables across the equations are significant (p — 0.001 for the test: 
ô =ô; р < 0.001 for the test: & = &). Taken together, the findings from the second test are 
consistent with investors not fully incorporating the implications of /GL for future CFO, and with 
investors fixating on inflation gains and losses without distinguishing monetary and nonmonetary 
assets. 

In sum, the two tests above use two different büt complementary approaches to investigate 
how investors process inflation information in the cross-section of firms. The findings from these 
tests provide consistent evidence that the negative return pattern documented across portfolios 
ofinflation gains is consistent with an inflation adjustment argument, whereby investors do not 
distinguish monetary and nonmonetary assets. З 


VIL. ROBUSTNESS TESTS 
Are the Abnormal Returns Attributable to Risk? 


The return analyses provide evidence of mispricing. However, although the abnormal returns 
can be attributable to mispricing from inflation-adjusted information being costly to obtain and 
process, they may also be attributable to an omitted inflation-based risk factor. To verify the source 
of the abnormal returns, I first compare risk characteristics across IGL portfolios, and find that 
there is no pattern in risk characteristics across the portfolios.” I also conduct a two-step Fama- 
MacBeth test, which examines whether a risk factor is priced. This test uses an inflation-based 
factor and analyzes whether there is a positive risk premium to this factor in a cross-sectional. 
regression analysis (Fama and MacBeth 1973; Fama and French 1992). 

In the first step, I estimate time-series regressions at the portfolio level, based on the 25 
portfolios constructed on MVE and BTM (Fama and French 1993): Р 


Крм 7 Крт = №, + Bp мктвељ` МКТКЕ + В, supo: $МВ,, + Bonum: HML, 
+ By,umpm* ЧМО, + Вр вст ` FIGLg + Kms (5) 


where FIGL is a monthly factor-mimicking inflation information portfolio that I form based on 
IGL information in a similar way to how FNOA is constructed in Section V, following the 
procedure Fama and French (1993) use in forming HML and SMB. I focus on the 25 Fama-French 
portfolios because explaining their cross-sectional pattern in returns has attracted increasing inter- 
est in the literature (e.g., Lettau and Ludvigson 2001). 26 | use the time-series portfolio regressions 
to obtain the predicted factor loadings (betas) for each of the 25 portfolios, estimated using 


24 Note that there may be other channels that contribute to the negative association between ІСІ, and future returns. For 
instance, investors may apply different rules to relate JGL to future cash flows and, thus, although future cash flows are 
higher when inflation gains are higher (Table 2), growth in realized cash flows could be lower than expected. In this case 
there would be a positive association between 101, and future cash flows but a negative relation between ІСІ, and 
abnormal returns. Alternatively, investors may naively suppose that /GL turns into future cash flows in a one-to-one 
manner over some finite horizon, ignoring potentially important differences in the investment process associated with 
inflation gains. In this case inflation gains may generate future cash flows that are lower than what investors expect, 
which would lead to negative future returns. In this paper it is difficult to empirically investigate such channels without 
either a more structural model of the firms’ investment opportunities or more finely disaggregated cash flow data that 
provide details about the source of each firm's cash flows from operating and investment activities; making several 
additional assumptions would make my inflation measure noisy at best. Importantly, the two tests I use to infer how 
investors process inflation information are similar to those used in several prior studies. 

For example, the median 8, verge (B, sup; Вінмі) Віумо) varies between 1.00 (0.59; 0.11; —0.096) and 1.05 (0.68; 0.46; 
—0.13), respectively, with no trend across ІСІ, portfolios. 

26 More information about the 25 Fama-French portfolios (constructed on five MVE and five BTM portfolios) is at 

http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html. 
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five-year rolling windows that end at month m, with the requirement of at least ten portfolio-month 
observations in each window. 

In the second step, I estimate cross-portfolio monthly regressions of portfolio excess returns 
for month m+1 on the predicted rolling betas, such that each regression pools predicted betas for 
the 25 portfolios, as follows: 


Ro met Бут = Фън + УмкТЕЕ,т+1 ` Бр,мкткЕт + YSMB,m+1 ` Врдмв + YHMLm+1 7 Во,нмї,т 
+ Урмр,т+1 ` Bp, UMD,m + YFIGLm+1 ` Pp FIGLm + Сињ» (6) 


where Bp, MKTRF m Bp, SMBm Bp, HMLm: В, имот and Dp. FIGLm are the predicted portfolio betas 
estimated in the first step using data conditioned on month m. I aggregate and conduct tests on the 
estimates following the Fama and MacBeth (1973) procedure. 

Table 7 reports the results. With respect to the Fama-French factors, the results are consistent 
with prior research, such as Petkova (2006) and Core et al. (2008). The mean estimated coeffi- 
cients (p-values associated with the Fama-MacBeth t-statistics) on the market, size, book-to- 
market, and momentum factors equal —0.0058, 0.0004, 0.0038, and 0.0001 (0.122, 0.857, and 
0.033), respectively. Turning to the coefficient of interest, the mean estimated coefficient on 


Êp,FICLm across the monthly cross-sectional regressions, 1.е., Урум, 18 0.0015 with a p-value 
of 0.318. The insignificance of the coefficient associated with the inflation gains factor, Унгар» 
is inconsistent with an omitted risk factor associated with inflation. Rather, this result is consistent 
with abnormal returns being attributable to mispricing. Such mispricing does not necessarily imply 
market inefficiency, however, because inflation-adjusted information 1 is potentially costly to obtain 
and process. 


Possible Confounding Factors 

In additional robustness tests, I check whether the abnormal return findings can be explained 
by factors that are known to explain returns in the cross-section. First, nominal leverage and 
nominal earnings-to-price ratio (E/P) are known to explain average returns. Although all of the 
return models in this study include as a control the size and book-to-market factors that are shown 
to absorb the roles of nominal leverage and E/P in explaining average returns (Fama and French 
1992), I analyze the pattern in leverage and E/P across the JGL portfolios. I find that the respective 
median leverage and E/P are 0.6 and 0.04 for portfolio one and 0.61 and 0.05 for portfolio ten, 
with small variation and no systematic pattern in these ratios across portfolios. I also find small 
variation and no systematic pattern in past returns across ће IGL portfolios, with a median past 
12-month return of 0.10 and 0.12 for portfolios one and ten, respectively. Second, to examine 
whether industry concentration helps explain the results from the return tests, I examine the 
percentage of observations from each of the 15 industries in each of the ten inflation gains 
portfolios. The results suggest that no specific industry dominates, with industry concentration 
varying from 0.7 percent to 27.3 percent in the most extreme cases across the portfolios. Third, I 
examine the possible effect of industry-related omitted variables on the positive association be- 
tween inflation gains and future cash flows by allowing not only the regression constant in Equa- 
tion (1) to vary by industry, but also the other coefficients. To do so, I estimate Equation (1) by 
industry (including year fixed effects), and summarize the industry cross-sectional estimates using 
Fama and MacBeth (1973) t-statistics. The results reveal no change in inferences regarding the 
significantly positive association between inflation gains and future CFO, with cross-industry 
mean coefficients equal 0.05579, 0.04341, 0.0348, and 0.07375 for the one- through four-year- 
ahead horizon, respectively. 

Fourth, I further examine the robustness of the negative pattern in the return results in Tables 
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TABLE 7 


Two-Step Fama-MacBeth Procedure: Results from Second Step 
Monthly Cross-Portfolio Regressions 


Ry mii 7 Ејми = Фин + YukrRE Ma Bp METRE, m t ҮзМв,я+1` Во,5мв,а + YHML тај 


* Вь,нмт.м + YUMD,m+1 * Bo, имо + Увібізтча * Вриа њи + Om 


: Fama-MacBeth 

Prediction Mean Std. Dev. Standard Error t-statistic p-value 
Фи 0.0125 0.0561 0.0036 3.51 <0.001 
YMETRE m+! —0.0058 0.0592 0.0038 -1.55 0.122 
Узмвтні 0.0004 0.0345 0.0022 0.18 0.857 
YHML mH + 0.0038 0.0321 0.0020 1.85 0.033 
YuMD ml 0.0001 0.0611 0.0039 0.03 0.976 
Унсынн ? -00015 0.0243 0.0015 —1.00 0.318 
Adj. R? 0.510 


The table presents summary results from monthly cross-portfolio regressions in the second step of a two-step Fama- 
MacBeth (1973) procedure. In the first stage, portfolio time-series regressions are estimated using the 25 Fama-French 
portfolios (Fama and French 1993). Value-weighted portfolio excess return for month m for each portfolio p, Ry m~ Ку» 
is regressed on the contemporaneous monthly returns for the Fama-French (MKTRF, SMB, HML), Carhart (1997) momen- 
tum (UMD), and inflation-based (FIGL) factors, using five-year rolling windows that end at month m (requiring at least ten 
portfolio-month observations in each window), as follows: 

Rpa Кра = Ap + By METRE E MKTRF,, + Pp SBm 1 SMB, + Врнмтт У НМЕ, + В.џирт 1 UMD a Вт 1 FIGL, 


+ Ky. This results in five predicted rolling betas as of month т. In the second step, monthly cross-portfolio regres- 
sions of portfolio excess returns for month m+1, Краља — Ку» are regressed on the predicted rolling betas from the first 
step, such that each regression pools predicted betas for the 25 portfolios estimated in the first step using data conditioned 
on month m. The estimates are aggregated and tested following the Fama and MacBeth (1973) procedure. Fama-French 
and momentum factors are from the Fama-French Portfolios and Factors dataset available through WRDS. Fama-French 25 
portfolio excess returns are from Ken French’s website. Rj is the one-month Treasury bill rate. The inflation-based factor, 
FIGL, is formed following Fama and Freach (1993), using the same procedure I use to form FNOA, as described in Table 
3. p-values are obteined from one-tailed (two-tailed) t-tests when there is (no) prediction. The sample includes all NYSE, 
AMEX, and NASDAQ U.S. stocks that are in the intersection of CRSP and Compustat, that have data available to calculate 

inflation gains and losses, and with fiscal year-ends between 1984 and 2008 (n = 25). 


3-5 Бу testing whether it is /GL and not the balance sheet bloat that causes the return distribution 
I find across IGL portfolios. Specifically, I sort observations into five NOA and five IGL portfolios 
(independent sorts) and obtain abnormal returns for the five-by-five, double-sorted portfolios. To 
assess the pattern across the portfolios, I also estimate a predicted trend regression model by 
regressing abnormal return on the JGL portfolio number (and an intercept). Untabulated results 
reveal that the negative return pattern across /GL portfolios persists in each of the NOA portfolios. 
The trend, which is the slope coefficient from the trend regression, is negative in all NOA port- 
folios (with a trend varies between —0.0056 and —0.0372 across the four abnormal return metrics 
Luse), and significantly so in all cases except three, where the trend is marginally significant (with 
a p-value varies between 0.001 and 0.104 across the related four abnormal return metrics). These 
results provide additional evidence that the return distribution in Tables 3-5 is not caused by the 
balance sheet bloat. I also conduct an analysis wherein I repeat the return tests in Tables 3 and 4 
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after constructing FNOA as explained in Section III, but using ten NOA portfolics (instead of two 
portfolios as in Section III). Thus, FNOA is constructed based on the intersection of three BTM 
portfolios and the top and bottom deciles of МОА. This is because most of the negative return 
documented in the balance sheet bloat is concentrated in the top one to two deciles of NOA. 
Untabulated inferences from this analysis are unchanged relative to the findings in Tables 3 and 4. 
For example, the hedge return in Table 4 is positive in 21 of the 25 sample years, with a mean of 
0.097 and p < 0.001.” 


УШ. CONCLUDING REMARKS 

I hypothesize and find that inflation information in the current period affects future cash flows. 
Specifically, I find that unrecognized inflation gains turn into future cash flows from operations 
over the subsequent four years. I also find that investors do not fully incorporate such information 
into their investment decisions, in that I ünd significant abnormal returns for inflation-based 
trading strategies. I further find that the direction of the abnormal returns is consistent with 
investors not fully distinguishing monetary and nonmonetary assets. Robustness tests show that 
the documented abnormal returns are not attributable to differences in risk characteristics, and that 
an inflation-based factor is not a priced risk factor. These results are consistent with future abnor- 
mal returns being attributable to mispricing from costly information rather than to an omitted risk 
factor. Overall, the findings suggest that inflation has significant implications for performance and 
Stock prices, even when inflation is relatively low. 

My findings complement prior studies that primarily investigate stock pricing effects over a 
short event window or the contemporaneous year. À common theme from these studies is that 
inflation-adjusted data are inconsequential for making financial decisions. Watts and Zimmerman 
(1980) and Beaver et al. (1983) suggest that a potential interpretation of this conclusion is that 
Stock prices are not affected by inflation-adjusted data because such information is new, and 
market participants have not yet learned how to analyze and process it (the “learning effect"). 
Also, Beaver and Landsman (1983) reason that the inflation-adjusted data are potentially too 
complex and unfamiliar to use. My study contributes to this literature by indicating that (1) 
investors do not appear to ignore inflation-adjusted data, which suggests that the learning effect 
has been partially realized over the past decades, and (2) investors do not fully incorporate 
inflation information, consistent with such information being complex and unfamiliar. Further, this 
study is the first to identify the mechanism that leads to the mispricing by linking stock returns to 
the underlying valuation fundamentals and revealing how inflation-adjusted information is pro- 
cessed by the stock market. 


21 1 conduct two additional sensitivity tests. First, I estimate a model that pools the ten /GL portfolios and allows the 
intercepts for the extreme portfolios to vary using indicator variables. Second, I examine whether the firm-level return 
tests hold when using median abnormal returns. Untabulated results reveal unchanged inferences. Also, this study's 
return findings are distinct from, and cannot be explained by, inflation illusion. The inflation illusion hypothesis 
(Modigliani and Cohn 1979) posits that highly levered firms are more undervalued because of investors' failure to 
incorporate the gain accruing from purchasing power depreciation of nominal liabilities. Because the erosion of nominal 
liabilities leads to higher inflation gains, the direct eifect from the inflation illusion hypothesis is higher (lower) future 
abnormal returns when inflation gains are high (low) as investors who suffer from inflation illusion are positively 
(negatively) surprised over future periods. However, despite this offsetting effect of the inflation illusion on the findings 
from the return analyses, my results are incremental to the inflation illusion effect in that I find that future abnormal 
returns are negatively related to inflation gains (see Section V). The inflation illusion hypothesis also posits that 
investors irrationally discount inflation-adjusted cash flows using nominal interest rates. Here, in contrast, I investigate 
how inflation directly affects cash flows, rather than how the cash flows are discounted. 


The Accounting Review May 2011 
American Accounting Association 


Inflation and Nominal Financial Reporting: Implications for Performance and Stock Prices 1079 


APPENDIX 
The appendix develops and validates an algorithm for incorporating inflationary effects into 
accounting amounts, using only publicly available information, by adjusting nominal to inflation- 
adjusted amounts on a firm-by-firm basis for a broad sample of firms.”8 


Inflation Adjustment Algorithm 
Financial statements can be restated using the balance sheet or the income statement.” I rely 
on the balance sheet to adjust the nominal financial statements." 


A. Step I: Adjustment of Nonmonetary Items 

Nonmonetary items are linked to the dollar as of the year-end, but represent either a historical 
cost or a right (obligation) to receive (deliver) services for which purchasing power is not constant. 
I adjust these items as follows: 

A.1. PPE: I use the PPE life cycle to adjust PPE. An asset's useful life is the period over 
which the entity expects to consume economic benefits from the asset. Assuming that accounting 
depreciation, on average, reflects an asset's useful life, the PPE life cycle is the average number 
of years from the asset's purchase until it is fully depreciated. I thus calculate the PPE 
life cycle as: PPELifeCycle, = (1/ п) + Enl GrossPPE! PPE Depreciation], averaged over 
the four years prior to year-end t(n = 4). Next, I adjust Net PPE as follows: adjNetPPE, 
= NetPPE,: CPI,/ CPI, ца, where adj refers to “adjusted;” t refers to the year t fiscal year-end; 
Т) is the period prior to fiscal year-end t, stated in annual terms and calculated as 
тт) = 0.5 · PPELifeCycle,, and CPI denotes the Consumer Price Index.?! If PPELifeCycle is nega- 
tive, missing, or greater than the Compustat median limit of weighted expected useful life among 
different asset classes, which is calculated based on the expected maximum useful life of different 
PPE classes (e.g., U.S. Regulation 2003) varying between 20 years (e.g., Machinery and Equip- 
ment) and 50 years (e.g., Other Structures and Facilities), then I set it to the median life cycle 
calculated using the Compustat population over the sample period.” 


28 An extended Appendix that includes further information, rationale, and illustrative details regarding the algorithm is 
available from the author upon request. 

29 The clean surplus relation makes the two approaches equivalent. This is because the income statement approach derives 
inflation-adjusted income before financing expenses by adjusting income statement amounts, whereas the balance sheet 
approach first calculates inflation-adjusted earnings using two successive balance sheets and then calculates inflation- 
adjusted financing expenses as the difference between net earnings and income before financing expenses. Inflation- 
adjusted financing expenses are the same if derived using the balance sheet or the income statement, resulting in same 
inflation-adjusted earnings under the two approaches. 

99 This is because (1) it avoids mistakes inherent in deriving JAEarnings directly from the income statement, (2) it is more 

accurate because all transaction dates and income statement amounts are not necessary, and (3) because I focus on 

inflation-adjusted earnings, rather than inflation-adjusted revenues or gross profit, I can bypass reliance on further 
assumptions necessary to adjust the income statements (e.g., the timing af revenues over the year). 

I multiply PPELifeCycle by one-half because the life cycle is derived from gross, rather than net, PPE so the expected 

remaining useful life is one-half the gross PPE life cycle. Information about the exact transaction dates and amounts 

over the life of the firm is unavailable. Such information could help in estimating the exact purchasing date of each 
component of PPE and adjust it based on the associated vintage's purchasing power. Instead, I make a simplifying 
assumption that the PPE in place is acquired evenly over its life with the firm. That is, I adjust PPE using one-half of 
the Gross PPE life cycle such that the expected value of the remaining useful life is one-half of the life cycle obtained 
from Gross PPE. Also, note that because the adjustment is accurate to the monthly level, whereas г refers to annual 
amounts, 7 is often a fraction (e.g., for an estimated purchase date of six months prior to fiscal year-end t, 7 = 0.5 and 

NetPPE, is adjusted using CPI,/ CPI, 0 5). 

Note that there can be alternative adjustment procedures depending on the assumptions used and the objectives under- 

lying the adjustment. My objectives are to: (1) ensure consistency with actual inflationary GAAP; (2) obtain a sample 

of firms for which Compustat does not necessarily have available adjustment parameters (e.g., inventory and deprecation 
methods); and (3) develop a procedure that can be validated on firms in another country. For example, Davidson et al. 

(1976) requires data on the depreciation method. Requiring data about the inventory and depreciation methods would 

reduce my sample considerably, because such U.S. data are unavailable for about 40 percent of the observations. 
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А.2. Inventory: I use the inventory turnover, ГТ, ratio to adjust inventory, using the ratio 
COGS/Inventory, where COGS is the Cost of Goods Sold. Year-end t inventory turnover is cal- 
culated as: IT, = СОСУ, / [ (INV, + INV, ,) / 2). If, e.g., IT, = 2, the firm invests in inventory twice a 
year so the average inventory is six months old. In expectation, year ¢ inventory will have a 
remaining life of 12/ (2. IT) when stated in months, or «(t)=1/(2-JT,) when stated in years. 
Thus, I adjust inventory as follows: adjINV, = У, - CPI,/ CPI, „д. If COGS or INV are missing 
or negative, ГГ is set to the median IT of the Compustat population over the sample period. 

А.З. Intangibles: І calculate the intangibles’ remaining life for time 4, denoted as оҳу), as the 
ratio of intangibles to the amortization of the related intangibles at time г. I assume that, in 
expectation, the number of years prior to the transaction generating the intangibles equals the 
remaining years until the amount of intangibles is fully reserved, and thus I adjust intangibles 
using the price index as of the expected value of the original transaction date, or adjIntangibles,— 
Intangibles, - СРІ,/СРІ, ху: І set intangibles’ remaining life to the median remaining life of 
intangibles for the Compustat population over the sample period if it is negative, missing, or 
greater than firms’ common weighted useful life of different intangibles classes, which is calcu- 
lated based on the useful life of different intangibles classes varying between 2 and 40 years (e.g., 
patents) and between 20 and 40 years (e.g., goodwill). Also, according to SFAS 142 (effective in 
2002), goodwill and other intangible assets no longer have a defined life for amortization but 
instead are tested annually for impairment. Because the algorithm uses amortization based on the 
pre-SFAS 141/142 period, it uses parameters obtained from the Compustat population to adjust the 
years that follow. I repeat all analyses without amortizing the years subsequent to 2002, and the 
inferences are unchanged. 

A.4. Common Stock, Preferred Stock, and Capital Surplus: These items, which are included 
in shareholders' equity and represent purchasing power as of the stock issue dates, consist of two 
layers: (1) all stock issues from a firm's establishment through 1-1, and (2) new equity issues 
occurring in year г (this layer can include several sub-layers, one from every equity issue that 
occurred over the year). I assume that equity issues are distributed uniformly over the year. To 
state amounts in constant dollars as of the reporting date, I begin by adjusting the first layer to 
derive retained earnings for both year t-1 and year t. In constant dollars as of t year-end, the 
adjusted amount of the first layer іп t-1 is equal to the amount in f for calculating year / adjusted 
earnings. Using this two-layer process allows one to adjust earnings without having information 
about all the preferred and common stock issue dates and amounts from firms' incorporation dates 
until 7-1. Thus, the following amount, which corresponds to the first layer and provides 1-1 
equity, appears in any two consecutive retained earnings and is used to extract inflation-adjusted 
earnings: adjE, ү =[CommonStock + PreferredStock + CapitalSurplus],.; - СРІ,/ CPI, 4. For the 
second layer, І obtain adjusted new issues during the year, AdjNewlssues, by calculating 
new issues, MNewlssues,- | Соттопбіоск + PreferredStock + CapitalSurplus], — | CommonStock 
+ PreferredStock + CapitalSurplus], 1, and adjusting this amount using one-half year's change in 
CPI, under the assumption that new issues occur uniformly throughout the year. 

A.5. Other Monetary Items in Stockholders’ Equity but not in Retained Earnings (О): Because 
earnings are obtained from the difference in retained earnings between two successive periods 
(adjusted for dividends and capital changes), it is necessary to exclude items that violate the clean 
surplus relation (e.g., Employee Benefit Trust) from inflation-adjusted retained earnings. This 
component is assumed to be monetary and is calculated as О, = TotalAssets, — TotalLiabilities, 
~ ReExOCI, ~ CommonStock, ~ PreferredStock, - CapitalSurplus, where ReExOCI, is рег А.б 
below. 

А.б. Retained Earnings Excluding Other Comprehensive Income (ReExOCI): It is critical to 
maintain the clean surplus relation when deriving earnings. Accordingly, I obtain nominal and 
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inflation-adjusted Retained Earnings Excluding Other Comprehensive Income. The inflation- 
adjusted amount is required because JAEarnings is derived using the two-period difference in 
inflation-adjusted ReExOCI. The nominal amount is used to derive O (per A.5.) as follows: 
ReExOCI = Retained Earnings (Compustat: RE) — Accumulated Other Comprehensive Income 
(Compustat: ACOMINC) The inflation-adjusted ReExOCI as of year t, adjReExOCl,, is 
derived by using the relation that total assets equal total liabilities plus shareholders' equity, and 
by stating all balance sheets amounts in constant dollars, where monetary (nonmonetary) items 
are not (are) adjusted: adjReExOCI,= adjINV,+ adjNetPPE, + adjIntangibles, + OA, — adjE, у 
~ adjNewlssues, — О, — Totalliabilities, (where, as ^ above, adjE,_,=[CommonStock+ 
PreferredStock+ Capital Surplus],_, - CPI,/CPI, |). Total liabilities are treated as monetary. I treat 
as monetary other assets (OA) that are not directly adjusted, and derive them as a residual value, 
using the relation that total assets equal total liabilities plus shareholders’ equity, as follows: 
OA, = TotalAssets, ~ INV, — NetPPE, — Intangibles,. 

A.7. Other Comprehensive Income and Other Items Affecting Retained Earnings without 
Directly Affecting Net Income (OtherInReExOCI): This item is used in the equation that derives 
IAEarnings. Two types of exclusions are subtle, yet necessary for the accounting identities to hold 
and thus for the accuracy of the algorithm. First, because JAEarnings is obtained using the two- 
period difference in adjReExOCI, dividends must be included in the adjustment. Second, all 
transactions that are neither part of Other Comprehensive Income nor part of Net Income need to 
be excluded (e.g., Net Issues of Common Stock under Employee Plans; Purchases and Sales of 
Treasury Stocks under Employee Plans). Because these exclusions are the result of transactions 
occurring at the year-end, I treat them as monetary. These amounts are calculated as: 
OtherInReExOCI, = ReExOCI, - ReExOCI, 4 ~ NetIncome,+ CommonDividends, 

4 PreferredDividends,. 

A.8. Dividends: Because dividends are usually paid quarterly, the adjusted common and 
preferred dividends, adjCommonDividends and adjPreferredDividends, are adjusted assuming 
these payments are distributed uniformly over the year. 


B. Step 2: Treatment of Monetary Items 

Monetary assets and liabilities are measured on the basis of a fixed number of dollars required 
for their settlement. Thus, nominal monetary amounts are already stated in terms of constant 
purchasing power and, accordingly, I treat monetary items as equal to their recognized nominal 
amounts. The following are considered monetary: Cash, Short-Term Investments, Total Receiv- 
ables, Total Liabilities, and assets not directly treated as nonmonetary assets (OA). The inclusion 
of OA implicitly treats unconsolidated but wholly-owned subsidiaries as monetary, consistent with 
Bernard and Hayn (1986). 


C. Final Step: Derivation of Inflation-Adjusted Earnings 
Inflation-adjusted earnings, JAEarnings, are calculated as follows: 


IAEarnings, = [adjReExOCI, - adjReExOCI, || + adjCommonDividends, 
+ adjPreferredDividends, ~ OtherInReExOCI, ~ adjExtraordinaryltems,. 


I obtain adjReExOCI, | analogously to adjReExOCI, (see А.б. above), except that in this case 
(1) I adjust the accounting amounts reported for year t—1 to the purchasing power as of t year-end, 
and (2) I do not subtract adjNewlIssues, у because it is already part of adjE, | as the new issues 
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during 1-1 are part of the 1—1 equity amount.” To reduce measurement error from the adjustment 
procedure, I delete observations each year in the top and bottom percentiles of the MVE, | deflated 
difference between JAEarnings and NominalEarnings. Because I investigate the behavior of 
IAEarnings versus NominalEarnings and because NominalEarnings refers to Net Income Exclud- 
ing Extraordinary Items, I exclude extraordinary items when deriving JAEarnings to make the two 
earnings measures comparable. 1 assume that extraordinary items, if any occur, are distributed 
uniformly over the year and thus are adjusted using one-half year's change in the price index; 
these items are denoted as adjExtraordinaryltems, ^ 


External Validation of the Algorithm 


To provide evidence on the external validity of the algorithm, I test the algorithm on a sample 
of Israeli firms. Until 2003 Israeli firms were required to recognize financial statements in 
inflation-adjusted terms and disclose in footnotes the same financial statements in nominal terms, 
and similar to the U.S., the inflation rate in Israel over the past decade was relatively low. In the 
validation analysis, I examine the extent to which nominal earnings derived by the algorithm, 
NominalEarnings!!, approximates disclosed nominal earnings, Мотта аттр “У, by esti- 
mating the equation: NominalEarnings”°“! = а + В. NominalEarnings^^"*! + e. If the algorithm 
does a good job translating earnings from one measurement basis into the other, I predict the 
intercept to be equal to 0 and the slope to be equal to 1. Thus, I conduct the tests: Но: a=0 
against НТ: a 0, and НВ: В-1 against He: В # 1. To do so, I hand-collect data from Israeli 
firms’ annual nominal and inflation-adjusted financial statements over the 1995-2003 period for 81 
randomly selected firms listed on the Tel-Aviv 100 index. This index comprises the 100 firms with 
the highest MVE and accounts for more than 80 percent of the total market's capitalization. The 81 
firms that I sample account for 86.63 percent of this index's total market capitalization as of 
December 21, 2005. 

After implementing the algorithm and requiring the same restrictions as with the U.S. data, 
the inflation-adjusted Israeli sample includes 503 firm-year observations. Also, because footnotes 
are not always attached to the financial statements, causing nominal footnote disclosures to not 
always be available, I randomly select 50 firms and gather nominal information, when such 
footnotes are available. Monthly CPI and exchange rate data are obtained from the Israeli Central 
Bureau of Statistics. The Israeli sample reflects a median firm size of $220 million. The mean and 
median values of the difference between actual (ie., reported) inflation-adjusted earnings and 


33 Specifically, adjReExOCI, | = adjINV, | + adjNetPPE,., + adjlntangibles, | + adjOA, , — adjE,., — adjO,., 


— adjlotalliabilities, ү, where: adjINV, , = INV,., © СРЇ/СРЇ үүр adjNetPPE, у = NetPPE,_, 
CPI, ICPI,..,.. иу adjIntangibles, у = Intangibles, " CPI, /CPI, A. quy adjOA._, = OA,_, · CPI,/ICPI,_,; adjO,.., 
= Oy CPLICPIL ,; as above, adjE, = [CommonStock+ PreferredStock+CapitalSurplus],, + CPI,/CPI,..,; 


adjTotalLiabilities, ., = TotalLiabilities, у - CPI,/CPI,_,; and w(t—1), 1-1), and w(t—1) refer to the period (stated in 
years) from which the lagged nonmonetary assets INV, NetPPE, and Intangibles, respectively, are adjusted. 

* With respect to the derivation of ЈС, there is a normalization based on a reference point underlying the adjustment 
procedure, Specifically, accounting amounts can be adjusted to be stated based on either constant dollars to maintain 
transactions in purchasing power, or current dollars to maintain transactions in consumption units. In the cross-section, 
the variation in ІСІ, rather than its level, is informative for explaining variation across firms, and the two approaches are 
equivalent when intercepts are added to the tests. I choose to adjust for constant dollers, leading IGL to be more 
frequently negative. Alternatively, ГОТ, can be adjusted such that it is more frequently positive but the variation across 
firms and over time is unchanged. Accordingly, if the prediction model is CF; =a + b- IGL +X, + 7,4, where X is a 
vector of additional explanatory variables (conditioned on the time г information set), analyses throughout the study 
pertain to the parameter b, which is invariant to the reference point underlying the measurement system. The intercept, 
a, varies with the measurement system but is not a parameter of interest in my prediction analyses. Accordingly, the 
research design throughout my study includes intercepts in all cross-sectional tests and focuses on the coefficient on 
ІСІ. 
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nominal earnings, JGLA™, are —0.02 and —0.01, with a standard deviation of 0.07. This suggests 
a difference of about 1 to 2 percent of firms’ size, with large variation between the two measures. 

The results reveal that the null hypotheses of о = 0 (р = 0.609) and В = 1 (p = 0240) 
cannot be rejected, with point estimates of а = 0.01 and В = 0.8.36 Overall, although the 
adjustment procedure does not use data about the timing and amounts of all of the firms’ trans- 
actions over the life of the firms until the reporting date (which are needed for complete inflation 
adjustment), the findings reveal that the algorithm provides a reasonable and unbiased proxy for 
the effects of inflation. 
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ABSTRACT: This study examines the effect of the readability of firms’ written commu- 
nication on the behavior of sell-side financial analysts. Using a measure of the read- 
ability of corporate 10-K filings, we document that analyst following, the amount of effort 
Incurred to generate thelr reports, and the informativeness of their reports are greater 
for firms with less readable 10-Ks. Additionally, we find that less readable 10-Ks are 
associated with greater dispersion, lower accuracy, and greater overall uncertainty In 
analyst earnings forecasts. Overall, our results are consistent with the prediction of an 
increasing demand for analyst services for firms with less readable communication and 
a greater collective effort by analysts for firms with less readable disclosures. Our 
results contribute to the understanding of the role of analysts as information intermedi- 
aries for investors and the effect of the complexity of written financial communication on 
the usefulness of this informatlon. 


Keywords: financial analysts; readability; Gunning Fog Index; analyst following. 


Data Avallability: 7he data used In this study are avallable from the sources indicated 
in the text. 


I. INTRODUCTION 
ver the past two decades, changes in financial and reporting regulations (e.g., changes in 
О segment disclosures, employee stock option reporting, and Sarbanes-Oxley disclosures) 
have significantly increased the amount of required disclosures by firms to external users. 
In addition, technological advancement and new developments in financial engineering have made 
it more challenging for firms to communicate information about the underlying fundamentals of 
their businesses in a clear and informative manner. The increase in the amount of required dis- 
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closure accompanied by the challenging task of communicating increasingly complex business 
transactions to investors has led to concerns about the effectiveness of management communica- 
tion and the ability of interested users to make informed decisions based on this information. As 
a result, both the SEC and the popular press have routinely criticized firms for the complexity of 
the language in their 10-K filings (Schroeder 2002). These groups have expressed concerns about 
the ability of financial statement users, especially small investors, to understand the complicated 
writing found in firms’ financial documents (SEC 1998; Schroeder 2002; Cox 2007). 

Given the increasing complexity of firm disclosures and the related concerns about their 
usability, a natural question arises as to the role of financial analysts in intermediating such 
information to investors. Specifically, do financial analysts use their expertise to examine this 
complex communication and provide useful information to financial statement users, or do they 
prefer to focus their efforts on firms with less complex communication? We attempt to answer this 
question by examining how the behavior of financial analysts relates to the readability of firms’ 
written communication. 

Prior literature examines the relation between the properties of firms’ financial disclosures and . 
the behavior of security analysts. These properties include the informativeness of disclosures 
(Lang and Lundholm 1996; Healy et al. 1999), the use of segment disclosures (Botosan and Harris 
2000), and the effect of intangible assets (Barth et al. 2001). These studies generally document that 
firms with better disclosure quality tend to attract greater analyst following. Other studies examine 
the effect of certain financial reporting items on the properties of analyst earnings forecasts. These 
studies find that analyst forecasts are influenced by firms’ accounting choices (Hopkins et al. 2000; 
Bradshaw et al. 2008), changes in the tax law (Plumlee 2003), the clarity of the income effects of 
specific financial items (Hirst and Hopkins 1998; Hirst et al. 2003), and international diversifica- 
tion (Duru and Reeb 2002). 

While each of these studies contributes to our understanding of the effect of firms’ financial 
reporting and disclosure choices on analyst behavior, they generally focus on the effect of a single 
financial statement item (e.g., interest rates, taxes) or disclosure (e.g., segment reporting).’ This 
approach, however, does not explicitly incorporate the potential effect of the properties of other 
financial statement items or disclosures on analyst following or the properties of their earnings 
forecasts. More important, this approach does not address the potential impact on analyst behavior 
of the overall complexity of firms' financial communication, that is, the cost of processing and 
interpreting the entirety of firms' disclosures. This potential limitation is particularly important, 
given the broad nature of corporate disclosures such as 10-K filings that describe a significant 
number of interrelated financial items. | 

We attempt to address this issue Бу examining the relation between a comprehensive measure 
of the overall readability of corporate 10-K filings and analyst behavior. This measure, known as 
the Gunning Fog Index (hereafter, Fog Index), incorporates the number of words per sentence and 
the number of complex words in a document to derive a measure of the readability or syntactic 
complexity of firms' 10-K filings (Jones and Shoemaker 1994; Courtis 1995; Li 2008)? The Fog 
Index has been widely used in social science research for several decades to examine the relation 


One notable exception is Lang and Lundholm (1996) who use AIMR scores as a measure of overall disclosure quality. 

The potential limitations of this measure, however, ace that it is based on a subjective survey of- analysts, limited to a 

subset of large firms, and no longer available after 1995, 
? Specifically, we focus on the readability of firms' written communication as a measure of the syntactic complexity of 
such communication. Stimulated by Chomsky's (1965) seminal generative grammar, there has been a great deal of 
theoretical and empirical work on what constitutes syntactic complexity and how it functions as a determinant of both 
sentence comprehension (Gibson 1998) and sentence production (Thompson and Faroqi-Shah 2002). According to Stone 
et al. (2005, 341-343), sentence complexity is affected by the features of open-class words (nouns and verbs) and their 
relationships and the number and typz of syntactic operations. 
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between the readability of written information and various decisions or outcomes.” Further, recent 
studies use the Fog Index to examine the readability of annual reports in connection with earnings 
persistence (Li 2008), timely price adjustment (Callen et al. 2009), and investment efficiency 
(Biddle et al. 2009). 

The Fog Index offers several important advantages. First, it allows us to study a large and 
diverse group of firms. Second, it is an objective measure, not based on analyst surveys or opinion, 
and can be calculated for any narrative disclosure. Finally, it allows us not only to capture the 
effect of the readability of a variety of financial items, but, more important, to directly examine the 
overall syntactic complexity of firms’ written public communication, over and above its specific 
content. 

We focus on the readability of the 10-K filing for several reasons. First, 10-K filings are 
required for all publicly traded companies and are frequently cited as an input in the decisions of 
investors and financial analysts (Previts et al. 1994; Rogers and Grant 1997). Second, both the 
SEC and popular press have criticized firms for the complexity of their language in these filings 
and the effect of this complexity on investors (Schroeder 2002). You and Zhang (2009) provide 
evidence that suggests that investors uüderreact to the information provided in 10-K filings and 
that this effect is stronger for firms with more complex 10-K reports. Finally, the 10-K filing 
contains a significant amount of written communication or narrative to use in interpreting its 
readability. These reasons make the 10-K filing an interesting setting to examine the effects of the 
readability of firm communication on the activities of financial analysts. 

We test the relation between the readability of 10-K filings and several measures of analyst 
behavior, including analyst following, analyst forecast revision response time, the information 
content of analysts’ reports, and the properties of analyst earnings forecasts. We begin our analysis 
by examining the relation between analyst following and 10-K filing readability. We argue that, on - 
the one hand, lower readability of firm financial disclosures increases the cost of processing the 
information in these disclosures and therefore will increase the demand for analyst services. If the 
cost of obtaining analyst reports is less than the information-processing costs of firm disclosures, 
then analyst following should be greater for firms with less readable 10-K filings. On the other 
hand, less readable disclosure could also increase the costs of analyst coverage. That is, analysts 
could bear greater information-processing costs and higher private search costs, which could lead 
to less accurate forecasts. If these costs are significant, then analyst following should be lower for 
firms with less readable disclosures. 

Consistent with the prediction of an increasing demand for analyst services for firms with less 
readable 10-K filings, we document a positive and significant association between a firm’s 10-K 
Fog Index and the number of analysts who cover the firm, even after controlling for other factors 
related to analyst coverage. We interpret this result as being consistent with a greater collective 
effort by analysts for firms with less readable disclosures. 

Next, we examine whether individual analyst effort is associated with the readability of the 
10-K filing. We measure individual analyst effort as the length of time required for analysts to 
issue their first forecast revision following the 10-K filing. If analysts bear costs in following firms 


w 


The Fog Index has been used in a variety of applications including medical error, consumer drug use, consumer 
warranties, mutual fund prospectuses, jury instructions, and academic research prestige. For specific examples, see 
Charrow and Charrow (1979), Armstrong (1980), Shuptrine and Moore (1980), Ott and Hardie (1997), Gazmararian et 
al. (1999), Koo et al. (2003), and Lee et al. (2006). 
4 In a recent speech, former SEC Chairman Christopher Cox specifically identifies readability measures as a tool to 
examine communication complexity. He states, “Just as the Black-Scholes model is commonplace when it comes to 
compliance with the stock option compensation rules, we may soon be looking to the Gunning-Fog and Flesch-Kincaid 
models to judge the level of compliance with the plain English rules” (Cox 2007). Similarly, Core (2001) proposes the 
use of computational linguistics methods in accounting to measure corporate disclosure quality. 
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with less readable disclosures, then it should take them more time, on average, to issue reports 
following less readable disclosures. Using a variety of empirical approaches, we find that analysts 
who cover firms with less readable 10-K filings take a longer time to issue their first report 
following the 10-K filing. This evidence is consistent with the argument that analysts exert more 
effort to follow firms with less readable disclosures. 

We then turn to an examination of the relation between 10-K readability and the information 
content of analyst reports. Similar to Frankel et al. (2006), we measure information content as the 
proportion of a firm’s stock returns related to analyst forecast revisions to the total firm’s stock 
return during the time period between the 10-K filing and the subsequent fiscal year-end. We 
predict that, if investors place greater reliance on analyst reports issued for firms with less readable 
10-K filings (potentially due to the higher costs associated with processing and interpreting a 
complex report), then the information content of analyst reports, or the proportion of firm returns 
associated with them, should be higher for firms with less readable disclosures. Consistent with 
this prediction, we find evidence that the proportion of firm returns associated with analysts’ 
reports is higher for firms with less readable 10-K reports. This evidence suggests that investors 
find analysts’ reports more informative for firms with less readable corporate disclosures. 

Finally, we examine how 10-K filing readability relates to commonly studied properties of 
analyst earnings forecasts: dispersion, accuracy, and two measures of earnings consensus forecast 
uncertainty (overall and common uncertainty) based on Barron et al. (1998). Prior literature finds 
that the 10-K filing represents a major source of information used by analysts (Previts et al. 1994; 
Rogers and Grant 1997). However, studies also find that the complexity of items in the 10-K filing 
affects analysts’ use of this information. For example, Plumlee (2003) finds that analyst forecasts 
are less accurate if they are associated with complex changes in the tax law. Bradshaw et al. 
(2008) find that differences in accounting choice negatively affect forecast accuracy and increase 
dispersion. Our tests relate to these findings, because we examine the effect of complexity through 
the ability of analysts to incorporate less readable financial information into their forecasts. If 
lower readability increases the costs of processing and interpreting firm disclosures, then it could 
limit the ability of analysts to correctly incorporate all of the pertinent information and could result 
in disagreement or ambiguity. We predict and find that analyst earnings forecasts for firms with 
less readable 10-K reports have greater dispersion, are less accurate, and are associated with 
greater overall analyst uncertainty. We also find that analyst common uncertainty is increasing in 
10-K Fog. These results suggest that analyst forecasts are affected by 10-K readability and provide 
indirect evidence to support the notion that the information contained in the 10-K filing is used by 
analysts. 

This study contributes to the literature in the following ways. First, our overall findings that 
lower readability is associated with greater levels of analyst coverage, effort, and information 
content, but with lower accuracy, higher dispersion, and greater uncertainty of their earnings 
forecasts complements and contributes to the literature on how analysts respond to firms’ disclo- 
sure. While prior studies generally focus on the effect of the complexity of specific attributes of 
firms’ disclosure on analyst behavior, we provide evidence that suggests that analyst behavior is 
associated with overall disclosure readability.° While the collection of the evidence in this study is 
consistent with this inference, the evidence obtained from each individual test is subject to limi- 
tations common to our research design. 


5 We note that covering firms with less readable communication is likely to result in a trade-off. Investors are more likely 
to find analyst coverage useful for firms with less readable communication; however, earnings forecasts for these firms 
will potentially be less accurate. Our findings suggest that the benefits of coverage outweigh the costs, because we 
document that both greater coverage and less accurate forecasts are associated with less readable communication 
(see also Li et al. 2009). This is consistent with the notion that investors value more than just forecast accuracy 
(Schipper 1991; Clement and Tse 2003). 


The Accounting Review May 2011 
American Accounting Association 


The Effect of Annual Report Readability 1091 


Second, our findings relate to the debate about the intended audience of public corporate 
filings. Since its organization, the SEC has made consistent efforts to encourage firms to make 
their regulatory filings accessible to the average or “lay” investor (Firtel 1999).° The most recent 
of these efforts is the plain English disclosure rules adopted by the SEC on January 22, 1998. Our 
evidence, that less readable corporate disclosures are associated with greater analyst following and 
more informative analyst reports, suggests that analysts could serve as an available information 
intermediary for investors and relates to the SEC’s concerns about the accessibility of these 
reports. 

Section П discusses the role of the readability of firm communication and the hypotheses 
development. Section Ш describes the data and sample, and section IV presents our empirical 
evidence. Section V concludes. 


IL HYPOTHESES DEVELOPMENT 

Less readable communication is more difficult to interpret and process by investors because it 
requires that investors devote more time and effort to identify and extract relevant information 
(Bloomfield 2002). In this study we focus on the written complexity of firm disclosures, as 
measured by readability rather than content. We interpret disclosure readability to be a measure of 
the costs incurred by users to process and interpret a firm’s written communication after control- 
ling for the operational complexity of the business.’ Firms with similar operations provide disclo- 
sures with varying levels of readability. For example, Berkshire Hathaway and AIG both provide 
a section about their reinsurance businesses in the management’s discussion and analysis section 
of their 10-K filings for fiscal year 2003. While the subject matter and underlying business 
complexity of the reinsurance business sections are very similar across the two firms, Berkshire 
Hathaway’s explanation is more readable as evidence by a lower Fog Index." This is not surpris- 
ing, insofar as Berkshire Hathaway CEO Warren Buffett is a strong proponent of more readable 
communication. 

We begin our analysis by examining the effect of disclosure readability on analyst following. 
We assume that users of financial information have different abilities to process complex commu- 
nication (Indjejikian 1991; Ball 1992). These differences provide opportunities for information 
intermediaries, such as financial analysts, to profit from their private analysis of firms by selling 
their opinions to users with greater information-processing costs (Schipper 1991). These profitable 
opportunities are arguably greater for firms with less readable disclosures because of the greater 
cost to users of processing the firms’ information. As such, if analysts respond to this increased 
demand for their services, then we expect analyst following to be greater for firms with less 
readable disclosures. 


Some academic scholars and practitioners have argued that the primary users of this information should be market 
professionals such as analysts and not the "lay" investor (Kripke 1970; Schipper 1991). This is because nonprofessionals 
might not possess the skill or expertise to read and understand the complex financial information contained in disclosure 
documents абі, therefore, any effort to gear disclosure toward the layperson is a waste of time and money (Kripke 
1970). Further, critics argue that the SEC's efforts to appeal to the average investor are not only inefficient, but also are 
hazardous to the disclosure regime because simplified and concise disclosure often leaves out many issues that are 
potentially valuable to the professional and leads to potential legal liability. 

It is important to note that mare readable communication can come at a cost. Legal scholars point out that “it is much 
harder to simplify than to complicate” (Kimble 1994, 53) and it requires significant skill, work, and time to compose 
documents in plain language. Additionally, some critics argue that plain English communication increases the risk of 
litigation (Kripke 1973). 

* The Fog Index for AIG's reinsurance business section of ће MD&A is 18.51, while Berkshire Hathaway’s is 17.23. The 
difference between these scores (1.28) can be interpreted as the number of additional years of formal education required 
to understand the text on a first reading. Further details regarding the Fog Index are provided in Section IIL 
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We note, however, that analysts also face a variety of costs in covering firms with less 
readable communication. First, analysts bear the direct costs of processing the information pro- 
vided by management. Second, less readable communication can result in confusion, requiring 
analysts to incur greater private search costs as they obtain additional information to evaluate and 
interpret management’s communication. Third, greater syntactic complexity could lead to inaccu- 
rate forecasts and recommendations that adversely affect analysts’ careers (e.g., Mikhail et al. 
1999; Plumlee 2003; Hong and Kubik 2003). Finally, Li (2008) finds that firms attempt to 
obfuscate bad news by increasing the complexity of their communication, and Lang et al. (2004) 
find that analysts are less likely to cover firms with incentives to withhold or manipulate infor- 
mation. These potential manipulations along with the costs to analysts of covering firms with less 
readable communication discourage analyst following. Because there are both potential positive 
and negative consequences of syntactic complexity on analyst following, we test the null hypoth- 
esis that syntactic complexity has no association with analyst following. 

Given the difficulty of following firms with less readable disclosures, analysts who choose to 
follow these firms likely exert greater effort to do so. One way of measuring the effort that 
analysts exert in following firms with less readable disclosures is to examine the average time 
from the firm’s 10-K filing to the analyst’s first report subsequent to the filing. If lower readability 
results in higher processing costs to analysts, then firms with less readable communication will 
have greater average response time from the analysts that cover them as compared to the analysts 
covering firms with more readable communication. Similarly, analysts who cover a portfolio of 
firms that have less readable communication should take longer, on average, to issue their reports 
than other analysts. We test the hypothesis that analysts exert greater individual effort to cover 
firms with less readable disclosures. 

Readability could also affect the properties of analyst earnings forecasts. Specifically, we 
examine its effect on the information content, accuracy, and dispersion of analyst forecasts and the 
uncertainty in analysts’ information environment. Analyst reports for firms with less readable 
communication likely provide information that is more useful to investors due to greater costs in 
processing public information. Analyst reports for firms with less readable communication would 
also be more informative if analysts choose to acquire and incorporate more private information 
due to the difficulties involved in interpreting less readable disclosures. Accordingly, we predict 
that analyst reports for firms with less readable communication will have greater information 
content, on average, than firms with more readable communication. 

Finally, because analysts who follow firms with less readable communication bear the costs of 
processing and interpreting such disclosures, syntactic complexity could also affect the dispersion 
and accuracy of analyst earnings forecasts as well as the degree of the overall and common 
uncertainty embedded in analyst forecasts. If less readable communication increases analysts' cost 
to process and interpret the information, then it is likely to lead to a more diverse set of interpre- 
tations about firm disclosures, resulting in higher analyst forecast dispersion. In addition, if less 
readable communication makes it more difficult to forecast earnings, then the accuracy of the 
analyst consensus forecast will be lower for firms with less readable communication. However, 
this difference in accuracy can be fully or partially offset by greater analyst effort in response to 
greater syntactic complexity. Accordingly, we predict that analyst forecasts will have greater 
dispersion and be less accurate for firms with less readable communication. 





? Li et al. (2009) find tbat once analysts become all-stars they are more willing to cover firms with greater potential 
earnings management because investors could find their coverage more valuable for these firms. This suggests that 
coverage decisions can be related to favorable career outcomes; however, it is not clear how analysts assess the 
trade-offs between the demand for investment information and the cost of potentially less accurate forecasts. 
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Barron et al. (1998) point out that analyst earnings forecast dispersion and accuracy do not 
directly capture the theoretical properties of analysts’ information environment such as uncertainty 
because these measures relate in different ways to the idiosyncratic and the common components 
of error in analysts’ forecasts. They view idiosyncratic uncertainty as uncertainty in private infor- 
mation that affects each analyst differently, and common uncertainty as uncertainty in information 
that is common to all analysts. Accordingly, lower readability can be associated with higher 
dispersion and lower accuracy in earnings forecasts for reasons related to either or both common 
and idiosyncratic uncertainty in the analysts’ information environment. 

To further examine the association between readability and properties of analyst earnings 
forecasts, we follow the model derived in Barron et al. (1998) and construct empirical measures of 
the overall uncertainty (modeled as the sum of the idiosyncratic and the common uncertainty) and 
the common uncertainty in analyst forecasts. These measures combine the accuracy, the disper- 
sion, and the number of analyst forecasts, allowing us to examine more directly how readability 
relates to analysts’ information environment uncertainty. We predict that analyst forecasts for firms 
with less readable reports will be associated with greater overall uncertainty. The effect of read- 
ability on common uncertainty (і.е., how much the average belief reflects common versus private 
information) is less clear because analysts can make trade-offs between public and private infor- 
mation sources based on the relative precision and importance of these sources. 


Ш. SAMPLE AND VARIABLE DEFINITIONS 
Sample Selection 


Our initial sample is based on the intersection of firm/years available on the Compustat 
Fundamental Annual table and the SEC’s EDGAR filings database for fiscal years 1995-2006. 
These databases are joined based on Compustat GVKEY and the SEC’s Central Index Key (CIK). 
Firms without matches are dropped from the sample. For each firm-year observation, we down- 
load the corresponding 10-K filing. Filings with less than 3,000 words or 100 lines are dropped to 
ensure that a complete filing is examined and that no errors were made in the filing transmission. 
This procedure results in 57,642 observations. We obtain stock return data from CRSP, analyst 
data from I/B/E/S, institutional holdings data from Thomson Reuters (CDA/Spectrum), and infor- 
mation on management earnings guidance from the First Call Company Issued Guidelines data- 
base. This procedure yields a sample of 33,704 observations. 


Disclosure Readability 


Similar to Li (2008), we measure the readability of 10-K filings using the Fog Index (FOG). 
This index, developed in the computational linguistics literature, captures the written complexity 
of a document as a function of the number of syllables per word and the number of words per 
sentence. Specifically, we calculate the readability of the 10-K report for firm i in year t as follows: 


ЕОС; ‚= (average words per sentence + percent of complex words) х 0.4 (1) 


where a complex word is defined as one with three or more syllables. The index is interpreted as 
the number of vears of formal education required for a person of average intelligence to read the 
document once and understand 11.10 It is important to note that the Fog Index is a measuring tool, 


9 Before computing the Fog Index, we remove all tables, tabulated text, and financial statements from the 10-K. The 
constant of 0.4 found in Equation (1) was chosen by Robert Gunning based on the scores of a set of literary benchmarks 
in order that this specific interpretation could be made. Our inferences remain unchanged using alternative measures of 
readability such as the Kincaide grade level formula, the Flesch Reading Hase Index, and 10-K length. In addition, our 
main inferences are similar after including in the analysis the length of the 10-K filing as a control variable for the 
amount of disclosure provided by the firm (Leuz and Schrand 2009). 
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not a rule or formula for good writing; as stated by Gunning (1969, 12), “Nonsense written simply 
is still nonsense.” It predicts the readability of a document, but does not provide information about 
whether the writing is interesting or informative. Despite these limitations, it is objective and 
simple to calculate. It allows us to study the disclosure characteristics of a large and diverse group 
of firms and does not depend on analyst surveys or opinions. It also provides us with a compre- 
hensive measure of the overall syntactic complexity of 10-K filings as opposed to the complexity 
of individual financial items.” 

Table 1 provides descriptive statistics on the Fog Index for firms in the intersection of Com- 
pustat and EDGAR. As can be seen in Panel A, the overall mean and median of the Fog Index are 
19.52 and 19.38, respectively. As a comparison to help validate our measure, it is interesting to 
note that the average Fog Index for FASB Statement Nos. 1-122 is 22, the CPA Exams is 16, the 
CMA Exams is 17, Wall Street Journal is 12, and Reader’s Digest is 8 (Phillips et al. 2007; Cox 
2007). The relatively high mean and median Fog Index in our sample is consistent with concerns 
that financial reports are written in complex language. 

While the variation in the mean Fog Index over the sample period is modest, there is large 
variation within each year. For example, over the total sample years, the interquartile range for the 
Fog Index is from 1.58 to 1.88 (about 9 percent of the mean). There is also significant variation in 
the Fog Index within industries, despite similarities in the underlying business complexity within 
each industry. Table 1, Panel B provides examples of specific industries that have high and low 
levels of the Fog Index (industries are classified using the Fama and French 48-industry classifi- 
cation). The healthcare, insurance, trading (e.g., security brokers, investment offices, etc.), utilities, 
and telecommunications industries comprise the group with the highest Fog Index. The fact that 
these industries have a low level of disclosure readability is not surprising because they are 
characterized by complex contracts and business models that are difficult to communicate. How- 
ever, there is also significant variation within each industry. Industries with a low Fog Index 
comprise precious metals, shipping containers, food products, agriculture, and defense industries. 
While the precious metals and shipping container industries have small within-industry variation, 
most industries have an interquartile range of the Fog Index above 1.60. The relative rankings of 
these industries also help to validate the ability of the Fog Index to measure the readability of 
annual reports. 


Variable Definitions 


Our tests focus on examining the relation between the Fog Index and analyst following, their 
forecast revision response time, the information content of their reports, and the properties of their 
earnings forecasts. A description of the construction of these variables as well as our control 
variables follows. 


Analyst Following 

Similar to prior research (O'Brien and Bhushan 1990; Brennan and Subrahmanyam 1995), we 
define analyst following as the number of analysts (ANALYSTS) that comprise the first I/B/E/S 
consensus annual earnings forecast after the filing date of the 10-K report. We follow Bhushan 
(1989) and interpret this measure as a proxy for the collective effort of the financial analyst 
community in the analysis of an individual firm." Because some firms are not covered Бу I/B/E/S, 


Ч Prior literature used the Fog Index and other similarly constructed measures to examine the readability of the overall 
annual report (Jones and Shoemaker 1994), management's discussion and analysis (Schroeder and Gibson 1990), and 
the notes to the financial statements (Smith and Smith 1971; Healy 1977). 

12 Similar to Bhushan (1989), we acknowledge that our proxy is not a perfect measure because it assumes homogeneity 
among analyst effort levels. For example, there are differences in individual analysts’ effort levels based on differences 
in compensation, brokerage houses, etc. 
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we conduct tests for both the I/B/E/S sample and the full sample of firms, where missing coverage 
is coded as zero analyst coverage (Barth et al. 2001). Because these results are similar, we report 
our main results based on the I/B/E/S sample. 


Analyst Forecast Revision Response Time 


We define the analyst forecast revision response time as the time from the 10-K filing to the 
first annual or quarterly earnings forecast issued by each individual analyst following the firm. To 
ensure that we include only analysts who actively follow the firm, we require that each analyst 
issue a forecast in the 90 days prior to the 10-K filing and then another report within 90 days of 
the 10-K filing (this process eliminates 10,458 observations related to analysts who stopped 
coverage). Because earnings announcements can prompt analysts to issue reports, we exclude 
reports made after any earnings announcement that occurs after the 10-K filing, but before the end 
of the 90-day window. We then define analyst report duration as the length of time in days between 
the 10-K filing and the first report following the filing. In addition, we average individual analyst 
report duration at both the individual firm and analyst level for each year. We interpret this 
duration variable as a measure of the required amount of effort for an individual analyst to read, 
understand, and process the information contained in the 10-K filing and to issue an updated 
earnings forecast, As mentioned above, we expect analyst response time to be longer for less 
readable 10-K filings. 


Information Content of Analyst Reports 


Similar to Frankel et al. (2006), we measure the information content of analyst reports as the 
proportion of a firm’s stock returns related to analyst forecast revisions to the total stock return 
during the time period between the 10-K filing and the subsequent fiscal year-end. This measure is 
constructed as the sum of the one-day, absolute size-adjusted returns on the analyst forecast 
revision day, divided by the sum of the one-day, absolute size-adjusted returns over the entire 
window. We exclude analyst reports that coincide with earnings announcements. We treat mul- 
tiple reports issued on the same day as a single report. Also, an observation must have a minimum 
of 90 trading days with available data to be considered in our tests. Similar to Frankel et al. 
(2006), we interpret this measure as the percentage of total firm information provided to investors 
that is related to analyst reports. 


Properties of Analyst Earnings Forecasts 

We define analyst forecast dispersion (DISPERSION) as the standard deviation of the indi- 
vidual analyst forecasts in the first analyst consensus annual earnings forecast issued after the 
10-K filing for the fiscal period following the 10-K filing, scaled by share price 90 days before the 
consensus forecast date. Analyst forecast accuracy (ACCURACY) is computed as the squared 
difference between I/B/E/S reported earnings and the analyst consensus forecast, scaled by share 
price 90 days before the consensus forecast date. ^ We define analyst overall uncertainty 


13 Frankel et al. (2006) divide this measure by the number of forecast revision dates to obtain a measure of the average 
informativeness of an analyst report date. We omit this final step for two reasons. First, our variable of interest is the 
overall firm information that comes from analysts, not the average information content. Second, firms with less readable 
disclosures are likely to obtain greater amounts of information from analysts because analysts are more likely to revise 
their forecasts more often for such firms. We confirm this supposition in untabulated results. Thus, dividing by the 
number of forecast revisions would induce a negative correlation between the analyst informativeness measure and 10-K 
readability. Our results are robust to the inclusion of the number of analysts following the firm as a control for the 
expected number of analyst revisions. 

№ Our inferences are not sensitive to scaling the measures of analyst forecast accuracy and dispersion by the absolute value 
of earnings or using the unscaled variables. 
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(UNCERTAINTY oyggarz) and common uncertainty (UNCERTAINTY common) using the following 
equations derived by Barron et al. (1998):7 





UNCERTAINTY. =(1- | ж DISPERSION + ACCURACY 2 
OVERALL 177 #ANALYSTS: хэн @) 
DISPERSION | 
ACCURACY — ANALYSTS 
UNCERTAINTY common = . (3) 
UNCERTAINTY OVERAL, 


Following Barron et al. (1998), we interpret overall uncertainty (UNCERTAINTY дурра) 88 
the sum of the idiosyncratic uncertainty (і.е., uncertainty associated with analysts’ private infor- 
mation) and common uncertainty (ie., uncertainty associated with information common to all 
analysts). UNCERTAINTY common is calculated as the proportion of common uncertainty to over- 
all uncertainty. Thus, we interpret UNCERTAINTY common to be a measure of how much the 
average analyst’s belief reflects common versus private information. As shown in Equations (2) 
and (3), as DISPERSION approaches 0, UNCERTAINTY oygrar, approaches ACCURACY and 
UNCERTAINTY соммом approaches 1. This suggests that, when there is no disagreement among 
analysts (ie. DISPERISON equals 0), the total uncertainty is only associated with analysts’ 
common information and all information impounded in analyst forecasts is public. Empirically, 
because the uncertainty measures emphasize information across analysts, we require that each firm 
have at least four analysts following it. 


Control Variables 

Our analysis controls for a variety of variables that have been shown by prior literature to be 
associated with firms’ information environment and business complexity and therefore relate to 
analyst behavior. Prior work finds that firm size is the most important determinant of analyst 
following (Bhushan 1989; O’Brien and Bhushan 1990; Brennan and Hughes 1991; Lang and 
Lundholm 1996; Barth et al. 2001). These studies find that larger firms have greater analyst 
following and suggest that large firms have better information environments, potentially more 
complex operations, and greater demand for investment advice. We use the natural logarithm of 
market value as of the year ending prior to the 10-K filing (LOGSIZE) as a proxy for size. 

Following Barth et al. (2001), we include controls for growth. High-growth firms tend to 
attract greater analyst following due to investor interest and the potential for future investment 
banking deals. Further, analysts likely find it more difficult to accurately forecast earnings for 
firms with high growth, leading to greater disagreement among analysts and less accurate fore- 
casts. We define the variable GROWTH as the compounded average growth rate in sales over the 
prior three to five fiscal years. Our inferences are robust to the inclusion of the natural logarithm 
of the book-to-market ratio as a proxy for growth. 

Similar to Bradshaw et al. (2008), we include the natural logarithm of the number of business 
segments reported in the Compustat Segment File as a control for the underlying complexity of the 
firm. We also include a control for the level of institutional holdings, following the evidence in 
Bhushan (1989), Brennan and Subrahmanyam (1995), and Frankel et al. (2006). These studies find 
that institutional ownership is positively associated with analyst following and with the informa- 


15 We thank the reviewer for suggesting that we incorporate the Barron et al, (1998) measures into our analysis, Note that 
the variables we term overall and common uncertainty are termed analyst uncertainty and cansensus, respectively, in 
Barron et al. (1998). We use the term common uncertainty as opposed to consensus to avoid confusion regarding our use 
of analyst consensus forecasts. 
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tion content of their герогв. 9 Institutional ownership can also be associated with higher analyst 
forecast accuracy and lower dispersion because firms with high levels of institutional holdings 
tend to have better information environments. We define the variable PINST as the percentage of 
a firm's shares that are held by institutions from the 13F disclosures for the most recent quarter 
prior to the 10-K filing. Motivated by the evidence in Lang and Lundholm (1996), who document 
that analyst following increases with the quality of disclosures, we include the number of man- 
agement earnings forecasts made during the prior year (MFCOUNT) as a proxy for firm discre- 
tionary disclosure (Nagar et al. 2003; Cotter et al. 2006), as well as the absolute value of the 
cumulative, market-adjusted return for the two-day window around the 10-K filing (70-K NEWS) 
as a proxy for the disclosure informativeness.'" 

Similar to Barth et al. (2001), we include variables to control for firms' information environ- 
ment and business complexity, as these variables likely affect the properties of analyst earnings 
forecasts. Barth et al. (2001) examine the association between analysts' incentives to follow firms 
and the extent of their intangible assets. Because many intangible assets are generally not recog- 
nized and estimates of their fair value are not disclosed, Barth et al. (2001) argue that analysts 
have increased incentives to follow firms with greater intangibles due to increased demand from 
investors. They find that analyst following is greater for firms with larger research and develop- 
ment and advertising expenses. They interpret these results as evidence that analysts respond to 
the demands of investors for more information because of the difficulty in evaluating firms’ 
intangible assets. Other studies examine the effect of intangibles on the properties of analyst 
forecasts. Barron et al. (2002) find that analyst uncertainty increases with the level of a firm's 
intangible assets. Gu and Wang (2005) find that analyst forecast errors are increasing in firm 
intangible intensity. Intangibles are important for our tests because they could be associated with 
10-K readability. For example, it is likely more difficult to explain the operations of firms with 
high levels of research and development costs in a less complex manner. Similar to Barth et al. 
(2001), we define R&D as the ratio of research and development expense to operating expense and 
ADV as the ratio of advertising expense to operating expense. '* 

Finally, we include the standard deviation of firm monthly stock returns from the prior year 
(STD. RET) as a measure of information uncertainty." Bhushan (1989) suggests that private in- 
formation is more valuable for firms with higher return volatility and thus positively related to the 
demand for analyst services. However, it is likely that analysts bear increased costs for following 
firms with higher return volatility. Additionally, most of our tests include industry fixed affects 
(based on the Fama and French classification) and year fixed effects to account for variation in 
analyst following across specific industries and over time. This approach is used to help control for 
variation in business complexity or information uncertainty that is driven by industry or time. 


1$ O’Brien and Bhushan (1990) suggest that analyst following and institutional ownership are potentially endogenous. 
Frankel et al. (2006) assume that the variables are exogenous. Similar to Barth et al. (2001), we examine our results with 
and without the inclusion of institutional ownership and also with the inclusion of lagged institutional ownership. The 
coefficient on FOG remains positive and significant under these specifications. 

We thank a reviewer for pointing out the importance of controlling for the informativeness of the 10-K disclosure. 

15 Our results are robust to the inclusion of the amount of recognized intangibles and depreciation expense, which are also 
examined by Barth et al. (2001). 

19 While some of our other control variables (e.g., sales growth and R&D) are likely correlated with firms’ information 
uncertainty, we include this measure as a more explicit, market-based measure of this construct. In addition, the 
inclusion of an alternative measure, earnings volatility, does not significantly affect our results. However, the inclusion 
of this measure reduces our sample by about 20 percent due to the need for a sufficient time-series of earnings. 
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ТУ. RESULTS 
Summary Statistics 


Table 2 presents summary statistics for the sample of 33;704 firm-year observations that 
remain after imposing the availability of data on CRSP and 1/В/Б/5. Similar to Table 1 and 
consistent with concerns raised by the SEC about disclosure complexity, the mean (median) FOG 
score is 19.53 (19.38). The standard deviation and interquartile range are 1.42 and 1.71, respec- 
tively, with an interquartile range from 18.58 to 20.29. The mean (median) number of analysts per 
firm-year observation is 6.14 (4.00). The mean (median) average analyst forecast revision re- . 
sponse time is 17.71 (16.00) days at the firm-year level and 18.77 (17.50) days at the analyst-year 
level. This suggests that, on average, analysts do not respond immediately to 10-K filings due to 
the significant amount of information that must be processed and interpreted. The mean (median) 
analyst report information content (A7) is 0.130 (0.092). This indicates that, on average, almost 13 
percent of the information reflected in stock returns during the period between the 10-K filing and 
the end of the fiscal period is derived from analyst reports. The mean (median) of forecast 
dispersion is 0.008 (0.003) and that of squared forecast error (ACCURACY) is 0.078 (0.002). The 
mean (median) of UNCERTAINTY оуккды, and UNCERTAINTY соммок are 0.051 (0.005) and 
0.328 (0.267), respectively. This suggests that about 33 percent of analyst uncertainty about future 
earnings following the 10-K filing is based on public information. 

Table 2 also provides statistics on the control variables. The mean (median) size of our sample 
firms is $2.1 billion ($381 million), and mean (median) compound averages growth rate of sales 
(GROWTH) is 0.20 (0.11). The mean (median) number of business segment is 1.83 (1), and mean 
(median) percent of institutional ownership is 46 (45). The mean (median) number of management 
earnings forecasts is 1.44 (0) and the mean (median) 10-К NEWS 18 0.03 (0.02). The mean 
(median) ratio of research and development expense (R&D) and advertising (ADV) to operating 
expense are 0.08 (0.00) and 0.01 (0.00), respectively.” The mean (median) standard deviation of 
returns (STD_RET) is 0.14 (0.12). 

Similar to Li (2008), we find in untabulated results that, while FOG is significantly correlated 
with many of the control variables, the extent of these correlations is relatively small for most 
variables. This is consistent with the conclusion that it is difficult to explain a significant propor- 
tion of the variation in FOG using firm characteristics. This evidence is important because in this 
study we assume that the Fog Index measures the readability of 10-K disclosures rather than their 
content. Consistent with this assertion, the correlation between FOG and the absolute value of the 
10-K event return is small and insignificant. 

As a caveat, we note that disclosure readability can be decomposed into innate and discre- 
tionary components. While we include a variety of variables to control for innate readability, 
analysts’ information costs are based on total readability, so the relative magnitude of these 
components is less relevant. Our inferences are limited by the extent to which the Fog Index 
measures readability and is not confounded with other (uncontrolled) firm characteristics. It is also 
important to note that, for a specific variable to influence the readability measure, it must either 
increase the length of the average sentence in the 10-K document or increase the percentage of 
complex words. Simply increasing the length of the 10-K will not directly affect the measure. 
However, we examine the effect of disclosure length on our main results in sensitivity tests 
because length also can be a form of complexity (You and Zhang 2009; Loughran and McDonald 
2010; Miller 2010). 


Note that the medians for the number of management earnings forecasts, research and development expense, and 
advertising expense are zero. This 18 primarily because we follow prior literature and code missing items to be zero. 
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TABLE 2 
Sample Descriptive Statistics 

9 Variable п Mean Std. Dev. 01 Median Q3 
FOG 33,704 19.53 1.42 18.58 19.38 20.29 
" FANALYSTS 33,704 6.14 5.91 2.00 4.00 8.00 
RESPONSE Firm 17,868 17.71 10.76 9.50 16.00 24.25 
RESPONSE дла 5,737 18.77 8.49 12.50 17.50 23.50 
Al 30,716 0.130 0.139 0.037 0.092 0.177 
DISPERSION 26,078 0.008 0.018 0.001 0.003 0.008 
ACCURACY 29,055 0.078 0.447 0.000 0.002 0.015 
UNCERTAINTY OVERALL 17,241 0.051 0.280 0.002 0.005 0.016 
UNCERTAINTY common | 17222 0.328 0.401 —0.038 0.267 0.702 
SIZE 33,704 2100 5600 122 381 1300 
GROWTH 33,704 0.20 0.34 0.04 0.11 0.24 
SEGMENTS 33,704 1.83 1.32 1.00 1.00 3.00 
РЇМ5Т 33,704 45.98 26.78 22.95 45.31 67.60 
MFCOUNT 33,704 1.44 2.69 0.00 0.00 2.00 
10-K NEWS 33,704 0.03 0.05 0.01 0.02 0.04 
ADV 33,704 0.01 0.03 0.00 0.00 0.01 
R&D 33,704 0.08 0.17 0.00 0.00 0.08 
STD_RET 33,704 0.14 0.10 0.07 0.12 0.18 


This table reports descriptive statistics for observations that are available on the SEC's EDGAR, Compustat, CRSP, and 
VB/E/S databases. The variables are pooled across fiscal years 1995-2006, 


Variable Definitions: 
FOG = Fog Index of the 10-K filing calculated as (average words per sentence + percent of 
complex words) X 0.4; 
#ANALYSTS = number of analysts in the first consensus annual earnings forecast following the 10-K filing; 
RESPONSE ж = average number of days that it takes a firm's analysts to issue their first report following the 
10-K filing; . 
RESPONSE jnatyst = average number of days that it takes an analyst to issue his/her first report following the 
.  10-K filing; 

AI = sum of the one-day, absolute size-adjusted returns from analyst reports between the 10-К 
filing and the next fiscal year-end divided by the sum of the one-day, absolute size-adjusted 
returns over the entire window; 

DISPERSION - standard deviation of the individual analyst forecasts in the first analyst consensus annual 
earnings forecast issued after the 10-K filing for the fiscal period following the 10-K 
filing, scaled by share price 90 days before the consensus forecast date; 

ACCURACY = squared difference between I/B/E/S reported earnings and the first analyst consensus annual 
earnings forecast issued after the 10-K filing for the fiscal period following the 10-К filing, 
scaled by share price 90 days before the consensus forecast date; 

UNCERTAINTY oygrar, = sum of common and idiosyncratic uncertainty among analysts computed following Barron et 
al. (1998, equation 15), using the measures of accuracy, dispersion, and analyst following as 
previously defined in this table; 

UNCERTAINTY соммом = ratio of common uncertainty to total uncertainty among analysts computed following Barron 
et al. (1998, equation 16) using the measures of accuracy, dispersion, and analyst following 
as previously defined in this table; 

SIZE = number of shares outstanding (Compustat item CSHO) times the share prices at the most 
recent fiscal year-end (Compustat item PRCC В); 
GROWTH = compound average growth rate of firm sales (Compustat item SALE) over the prior 3-5 
years (sales, / sales, (ну; 
SEGMENTS = number of reported business segments in the Compustat segment file for prior fiscal year; 
PINST = percentage of institutional ownership from the quarter prior to the 10-K filing; 


(continued on next page) 
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TABLE 2 (continued) 


MFCOUNT = number of management earnings forecasts issued in the prior year, 
10-K NEWS = absolute value of the cumulative market-adjusted return for the 10-Е filing event window 
[0,1]; 
ADV = advertising expense (Compustat item ХАР) as a percentage of operating expense (Compustat 
item XOPR) from the prior fiscal year; 
R&D = research and development expense (Compustat item XRD) as a percentage of operating 
expense (Compustat item XOPR) from the prior fiscal year; and 
STD, RET = standard deviation of the firm's monthly stock returns from the previous fiscal year. 


Analyst Following and the Readability of 10-K Filings 

Our first prediction is that analyst following is affected by the level of readability of firms’ 
10-K filings, as measured by the Fog Index. To control for other factors that can affect analyst 
following, we estimate the following regression: 


#ANALYSTS,, = Bo + B\FOG,, + ByLOGSIZE, , + В,СКОУТН,, + B4LOGSEGMENTS;, 
+ BsPINST,, + BsMFCOUNT,, + Ву10-К NEWS,,+ В,АРУ, + ByR&D,, 
+ BioSTD_RET,,, + ei. (4) 


The estimation is performed using ordinary least-squares regression with industry and time fixed 
effects. 

Column 1 of Table 3 (model 1) reports the results of the linear model regression. t-statistics, 
presented in brackets, are based on standards errors that are robust to heteroscedasticity and are 
clustered at the firm level. The coefficient on FOG is positive and statistically significant, sug- 
gesting that analyst following is greater for firms with less readable disclosures. This is consistent 
with the notion that analysts respond to investors’ demand for investment information for firms 
whose disclosures are more costly to process. The coefficients on the contro! variables are con- 
sistent with prior research. Larger firms are associated with greater analyst following as well as 
firms with higher growth rates and higher institutional ownership. Consistent with Lang and 
Lundholm (1996), we find that disclosure practice (MFCOUNT) and disclosure informativeness 
(10-K NEWS), are positively related to analyst following. We find that analyst following is nega- 
tively associated with the number of business segments." Similar to Barth et al. (2001), we 
document that analyst following is greater for firms with higher amounts of advertising and 
research and development expenses. Consistent with Bhushan (1989), we also find that analyst 
following is positively associated with firm stock return volatility, suggesting that analysts provide 
greater support to investors when private information is valuable. 

In addition to examining the linear association between FOG and analyst following, we also 
examine the association of analyst following and FOG using nonlinear and semi-parametric speci- 
fications (second and third columns of Table 3, respectively). In the nonlinear specification (model 
2), we add a square term (FOG?) to Equation (2) to capture the possibility that the association 
between readability and analyst following differs based on the level of FOG. In model 3, we 
examine a semi-parametric specification by including indicator variables for the various quartiles 
of FOG. The coefficient on РОС? (model 2) is negative and significant, indicating that the role of 


?! This effect becomes negative after controlling for firm size, but is positive if size is omitted. The negative coefficient in 
the multivariate results is consistent with the findings of Bhushan (1989) and Johnston et al. (2009). 
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TABLE 3 
The Association between Annual Report Readability and Analyst Following 
Panel A: OLS Regression 











Variable Model 1 Model 2 Model 3 
Intercept —13.067*** —20.190*** —11.625*+* 
[-16.79] [-6.29] [--18.23] 
FOG 0.085* ** 0,798 ** 
[3.74] [2.58] 
FOG? —0.018** 
[72.33] 
FOG Q2 0.168** 
[2.41] 
FOG Q3 0.331*** 
[4.11] 
FOG 04 | 0.369*++ 
[4.18] 
LOGSIZE 2.623*** 2.624*** 2.623*** 
[62.20] [62.23] [62.22] 
GROWTH 0.361 *** 0,356" 55 0,356t** 
[3.90] [3.85] [3.85] 
LOGSEGMENTS —0.367*** —0.368*** —0.368*** 
[—4.68] [—4.70] [-4.70] 
PINST 0.011 *** 0.011*** 0.011*** 
[5.20] [5.20] [5.22] 
MFCOUNT 0.086* #* 0.086* ** 0.086*** 
[5.82] [5.82] [5.83] 
10-К NEWS 2.391*** 2.380*** 2.371 *** 
[5.46] [5.43] [5.40] 
ADV 7,555 Ж ЖЖ 7.564*** 7.553*** 
[4.57] [4.57] [4.57] 
R&D 1.696*** 1.673*** 1.662*** 
[5.61] [5.54] [5.50] 
STD_RET 0.932*** 0.924*** 0.931 *** 
[2.61] [2.59] [2.61] 
Time Fixed Effects Yes Yes Yes 
Industry Fixed Effects Yes Yes Yes 
п 33704 33704 33704 
Adjusted R? 0.63 0.63 0.63 
Panel B: Comparative Interquartile Effects 
Model 1 Model 2 Model 3 
FOG 0.146 0.188 0.201 
LOGSIZE 6.236 6.238 6.238 
GROWTH 0.071 0.070 0.070 
LOGSEGMENTS —0.403 —0.404 —0.405 
PINST 0.473 0.473 0.478 
MFCOUNT 0.172 0.172 0.172 
10-K NEWS 0.081 0.080 0.080 
ADV 0.042 0.042 0.042 


(continued on next page) 
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Panel B: Comparative Interquartile Effects 











Model 1 Model 2 Model 3 
RD 0.130 0.129 0.128 
STD_RET 0.098 0.097 0.174 


ЖЖ ЖЖЖ р < 0.05 and р < 0.01, respectively, two-tailed t-tests. 


Panel A reports coefficient estimates and t-statistics (in brackets) from the regression of analyst following on FOG and 
control variables. Analyst following is defined as the number of analysts contained in the first VB/E/S consensus annual 
earnings forecast following the 10-K filing. FOG QX is an indicator variable set to 1 if the firm’s 10-K Fog Index is in 
quartile X of the sample, and 0 otherwise. All other variables are as defined in Table 2. Industry fixed effects are based on 
the Fama and French 48-industry classification. t-statistics are robust to heteroscedasticity and clustered at the firm level. 


Panel B reports the estimated effect of an interquartile change in the variable of interest. 


readability increases at a declining rate. Also, the coefficients on the second, third, and fourth FOG 
quartiles (model 3) are positive and significant and monotonically decreasing across the quartiles. 
These results are consistent with analysts making a trade-off between the benefits of covering 
firms with less readable disclosures and the costs of following them. While these costs include the 
processing costs of covering firms with less readable disclosures, analysts likely find it more 
difficult to accurately forecast the earnings of firms with more complex disclosures. These costs 
can be important, as prior research has found that forecast accuracy has important career effects 
for analysts (Mikhail et al. 1999; Hong and Kubik 2003). The effects of FOG on the properties of 
analyst earnings forecasts will be examined later.” 

Table 3, Panel B compares the effects of an interquartile change in the independent variables 
on analyst following based on the marginal effect estimates in Panel A. Consistent with prior 
literature, size (LOGSIZE) has the largest association with analyst following. An interquartile 
change in the natural logarithm of size is associated with an increase of 6.2 analysts, while that of 
all other variables is less than 1. The interquartile effect of FOG is 0.15 in model 1, 0.19 in model 
2, and 0.20 in model 3. While the incremental effect of the complexity of 10-K filings on analyst 
following is moderate, it is greater than that of most of the other variables included in the 
regressions. These are the same variables that have been shown in the literature to be related to 
analyst following. Based on these comparisons, we conclude that disclosure readability is an 
important determinant of analyst following. 

Similar to Lang and Lundholm (1996), we also examine the association between changes in 
FOG and lead changes in analyst following. If analysts do, in fact, respond to changes in FOG by 
increasing their coverage, then we expect to observe a positive relation between current changes in 
FOG and future changes in analysts following. We investigate this relation by estimating multi- 
variate models using the changes in the independent variables from Table 3. We define the current 
change in the Fog Index and cther independent variables as the difference between their current 
values and those of the prior fiscal year. We define the lead change in analyst following as the 
difference between the number of analysts following the firm after the next 10-K filing and the 
number of analysts following the firm after the current report. Table 4 presents results from these 
tests. We find that contemporaneous changes in FOG are positively related to lead changes in 
analyst following. We also find that changes in firm size and institutional ownership are positively 


22 п a related study, You and Zhang (2009) examine stock returns following 10-K filings and find that investors underreact 
to firms with more complex annual filings. This apparent underreaction provides additional motivation for analyst 
coverage because prior work has found that analysts help improve market efficiency (Barron et al. 2002; Brown and 
Sivakumar 2003; Gu and Chen 2004). . 
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TABLE 4 
The Association between Changes in Annual Report Readability and Lead Changes in 
Analyst Following à 
Variable Model 1 Model 2 Model 3 
Intercept —0.038*** —0.051*** —0.073*+* 
[-3.79] [~4.68] [-3.03] 
АРОС 0.024*** 0.023** 
[2.66] [2.55] 
(АЕОС)? 0.007** 
[2.58] 
АЕОС 02 —0.009 
[—0.28] 
АЕОС ОЗ 0.045 
[1.36] 
АЕОС О4 0.110*** 
[3.20] 
ALOGSIZE 0.480*** 0.479*** 0.480*** 
[21.89] [21.83] [21.91] 
AGROWTH 0.048 0.045 0.050 
[0.66] [0.62] [0.68] 
ALOGSEGMENTS —0.103** —0.103** —0.102** 
[-2.12] [- 211] [—2.11] 
APINST 0.008*** 0.008*** 0.008*** 
[5.89] [5.82] [5.84] 
AMFCOUNT —0.015 -0.015 —0.014 
[-1.61] 1-1.61| 1-1.591 
A10-K NEWS —0.138 —0.138 -0.136 
[-0.91] [-0.91] [0.90] 
AADV 0.333 0.327 0.323 
[0.50] [0.49] [0.48] 
AR&D -0.075 -0.075 —0.074 
[-0.57] [-0.57] [-0.56] 
ASTD RET —0.637*** —0.631*** —0.631*** 
[-5.18] [-5.14] [--5.14] 
п 21226 27226 27226 
Adjusted R? 0.03 0.03 0.03 





ЖЖ *** р < 0.05 and р < 0.01, respectively, two-tailed t-tests. 

This table reports coefficient estimates and t-statistics {in brackets) from the regression of lead changes in analyst following 
on changes in FOG and control variables. Lead change in analyst following is the difference between the number of 
analysts following the firm after the next 10-K filing and the number of analysts following the firm after the current report. 
AFOG ОХ is an indicator variable set to 1 if the current change in a firm's 10-K Fog Index is in quartile X of the sample, 
and 0 otherwise. All other variables are current period changes of the variables defined in Table 2. t-statistics are robust to 
heteroscedasticity and clustered at the firm level. 
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related to lead changes in analyst following and that changes in management earnings guidance, 
number of business segments, and return volatility are negatively related.” In untabulated results, 
we find that FOG has the third-highest effect of the variables employed in our analysis. The results 
from this test are important because they help to alleviate concerns about endogeneity or prior 
period information shocks relating to the levels results in Table 3. 


Analyst Report Duration 


One potential measure of the costs or effort that analysts bear in following firms with less 
readable disclosures is the amount of time it takes them to issue reports following the 10-K filing. 
We label this length of time “analyst report duration." In this section, we examine the analyst 
report duration at both the analyst and firm level. We expect that analyst report duration will be 
longer for firms with less readable 10-K filings. As noted previously, to ensure that we capture 
analyst reports issued in response to the filing of the 10-K (and not in response to other corporate 
events), we include in this analysis only firm-analyst observations for which the analyst has issued 
at least one report during the 90 days prior to the 10-K filing and issues a report in the 90 days 
subsequent to the 10-K filing.” Table 5, Panel A presents univariate results based on individual 
analyst reports and the Fog Index of the firms they cover. We classify each firm-specific FOG as 
high (low) if it is greater (less) than the median FOG value for all firms in the sample. We find that 
analysts covering firms with a high FOG require 1.45 days longer, on average, to issue their 
reports. Panels B and C of Table 5 present the average analyst report duration per firm-year and 
analyst-year, respectively. The value of FOG at the firm level is the individual firm 10-K Fog for 
each specific year. The FOG score at the analyst level is the average 10-K Fog score of the firms 
that the analyst covered in each year.” The univariate results suggest that the average (median) 
analyst report duration for firms with FOG scores higher than the median is 1.11 (1.50) days 
(Panel B). At the analyst level, we find that analysts who cover a portfolio of firms with an average 
FOG score higher than the median analyst portfolio average take 2.29 (2.17) days longer on 
average (median) to issue their first reports. These differences are statistically significant at a 1 
percent level and are likely to be economically important given the rapid pace at which markets 
impound new information. 

To further examine the analyst report duration, we estimate ordinary least-squares regressions 
on the average analyst report durations at both the firm and analyst level. Table 5, Panel D presents 
the results of these analyses. Similar to the univariate results, we find that firms with higher Fog 
scores and analysts who cover stocks with higher Fog scores are associated with longer analyst 
report duration. Specifically, controlling for a variety of other factors that can affect firms’ infor- 
mation environment, the coefficient on FOG is 0.345 for the firm-level analysis and 1.138 for the 
analyst-level analysis, both of which are statistically significant. This suggests that firms and 
analyst portfolios with one unit higher of FOG have analyst response times that are 0.345 and 
1.138 days longer. Although it is difficult to assess the economic significance of these results, we 
note that given the pace with which new analyst information is incorporated in stock prices, even 
such short delays in producing a report could reduce the information content of the report. The 
results in Table 5 further indicate that firms with greater institutional ownership, discretionary 


® While less intuitive, one potential concern is that analyst following might lead FOG. For example, management might 
provide more complex information in response to a greatar analyst following. In untabulated tests we find no relation 
between changes in FOG and Јар analyst following. 

24 As noted previously, we exclude analyst observations made after earnings announcements that follow the 10-K filing 
and reports made after 90 days after the 10-K. Our results are similar if we use reports within a year after the 10-K 
filing; however, these observations are likely unrelated to the FOG score of the 10-K filing. 

25 To be included in our analysis each analyst must cover a minimum of four firms. Similar results are obtained by 


requiring only a single firm. 
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TABLE 5 
The Association between Annual Report Readability and Analyst Report Duration 


Panel A: Individual Analyst Reports 


Group n Mean Median 
High Fog Index 30,892 19.19 16.00 
Low Fog Index 30,884 17.74 15.00 
Difference 145*** 1.00*** 


Panel B: Average Analyst Report Duration Per Firm 


Group n Mean Median 
High Fog Index 8,701 18.33 16.75 
Low Fog Index 8,701 17.22 15.25 
Difference 1.11 *** 1.50*** 


Panel C: Average Analyst Report Duration Per Analyst 


Group n Mean Median 
High Fog Index 2,869 19.91 18.67 
Low Fog Index 2,868 17.62 16.50 
Difference 2.29*** 2.17*** 


Panel D: Regression Analysis 


Mean Duration Mean Duration 
at Firm Level at Analyst Level 

Intercept 9,430" -8.750"" 
[7.62] [-2.62] 

FOG 0.34511 1.138 1 
[5.79] [6.96] 
LOGSIZE 0.034 " —0.013 
[0.51] [-0.11] 

GROWTH —0.453 2.6341 
[-1.62] [3.93] 

LOGSEGMENTS 0.405 0.8441 
[2.57] [2.67] 

PINST 0.03911 0.10511 
[9.22] [11.50] 
MFCOUNT 0.170!" 0.019 
[5.68] [0.33] 

10-K NEWS —25.4091 -43.743""! 
[—12.33] [—8.01] 

ADV ; 10.053 30.9361 
[2.52] [4.06] 
R&D 1.92511 0.544 
[3.06] [0.56] 

STD_RET —6.300^* —13.345"t 
[5:53] [5.86] 
п 17868 5737 

Adjusted R? 0.04 0.11 


(continued on next page) 
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*** р < 0.01, two-tailed t-tests for differences of means and Wilcoxon for medians. 

t p < 0.01, two-tailed t-tests. 

Analyst report duration is defined as the length of time between the 10-K filing and ће first report for each analyst. Panels 
A, B, and C report descriptive statistics of the differences in the average analyst report duration at the individual analyst 
report, firm-year, and analyst-year levels. The firm and analyst levels are computed by averaging variables over the unit of 
observation. The High and Low groups are based on the median of the sample in each group. Statistical differences of 
means (medians) are computed based cn t-tests (Wilcoxon tests). 

Panel D reports coefficient estimates based on ordinary least square regressions at both the firm and analyst levels. All 
variables are defined as in Table 2. Оп che analyst level, the variables are averaged dver the portfolio of each analyst-year 
observation, t-statistics (in brackets) are robust to heteroscedasticity and clustered at the firm and analyst levels, 
respectively, 





disclosure, and firm intangibles are associated with longer average analyst response times. In 
contrast, firms with greater reaction to the release of the 10-K report and greater prior return 
volatility are associated with shorter analyst report response time.” Overall, the evidence is 
consistent with the notion that analysts exert more effort to cover firms with less readable disclo- 
sures. 


Information Content of Analyst Reports 

Our third hypothesis predicts that the informativeness of analyst reports is positively related to 
the readability of firms' 10-K reports. To test this hypothesis, we estimate the following ordinary 
least-squares regression with industry and time fixed effects: 


Al), Po + ByFOG;, + B3LOGSIZE, , + B;GROWTH,, + ByLOCSEGMENTS, ,+ BPINST;, 
 BSMFCOUNT, , + 8;10-К NEWS, АРУ, + BR&D;, + ByoSTD, ВЕТ,, &,. (5) 


Similar to the regression reported in Table 3, we also examine the association of the information 
content of analyst reports and FOG using nonlinear and semi-paremetric specifications. 

Table 6 reports the regression findings. t-statistics, presented in brackets, are based on stan- 
dard errors that are robust to heteroscedasticity and clustered at the firm level. Consistent with our 
hypothesis, the coefficient on FOG is positive and significant, suggesting that the informativeness 
of analyst reports is increasing in the complexity of the 10-K disclosure. This evidence is consis- 
tent with the notion that investors find analyst reports for firms witt. less readable disclosures more 
useful because of greater processing costs. Analysts' private information searches could also be 
more valuable in such cases. In model 2, the square term (FOG*) is negative but insignificant, 
whereas the coefficients on the individual FOG quartiles in model 3 are significant and monotoni- 
cally increasing. These findings are consistent with our prior results and support the notion that 
analysts trade off costs and benefits in their coverage of firms with less readable disclosure. The 
effects of the control variables on the information content of analyst reports are similar in direction 
to their effects on analyst following. Specifically, the informativeness of analyst reports is increas- 
ing in firm size, growth, institutiona] ownership, discretionary disclosure, firm intangibles, and 
return volatility. In untabulated results, we find that an interquartile change in FOG increases the 





26 In untabulated results, we include a control for the number of reports issued by an analyst for the analyst level results. 
The coefficient on this variable is not statistically significant and does not quantitatively affect the results as presented. 
We also estimate two proportional hazard models to examine the influence of readability on the time to an analyst's first 
report by conditioning at the firm and analyst level, similar to O'Brien et al. (2005). We find that, on average, the 
probability of an analyst making a report at any given time is between 1.7 percent and 3.2 percent lower for each unit 
change of FOG. 
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TABLE 6 
The Association between Annual Report Readability and the Information Content of 
Analysts’ Reports 
Variable Model 1 Model 2 Model 3 
Intercept —0.256*** —0.302*** —0.227*** 
[—16.90] [—4.09] [-20.43] 
FOG 0.002*** 0.006 
[3.25] [0.86] 
ЕОС? -0.000 
[-0.62] 
FOG Q2 0.003** 
[1.99] 
FOG Q3 0.006% * * 
[3.33] 
FOG Q4 0.007*** 
[3.51] 
LOGSIZE 0.049% ** 0.049*** 0.049* ** 
[65.99] [66.03] [66.02] 
GROWTH 0.012*** 0.011 *** 0.011*** 
[5.54] [5.52] [5.50] 
LOGSEGMENTS —0.011*** —0.011*** —0.011*** 
[-6.47] [- 6.47] ^ . [7648] 
PINST 0.000*** 0.000*** 0.000*** 
[10.09] [10.09] [10.12] 
MFCOUNT 0.003*** 0.003*** 0.003*** 
[8.70] [8.70] [8.72] 
10-K NEWS 0.049 ** 0.049*** 0.049*** 
[4.34] [4.33] [4.30] 
ADV 0.089*** — - 0.090* ** 0.089* ** 
[2.65] [2.65] [2.64] 
R&D 0.006 0.005 0.005 
| [0.92] [0.89] [0.83] 
STD_RET 0.036*** 0.036*** 0.036*** 
[4.41] [4.41] [4.42] 
Time Fixed Effects Yes Yes Yes 
Industry Fixed Effects Yes Yes Yes 
n 30716 30716 30716 
Adjusted R? 0.51 0.51 0.51 


жж жжж р < 0.05, and р < 0.01, respectively, two-tailed t-tests. 

This table reports coefficient estimates and t-statistics (in brackets) from the regression of the information content of 
analyst reports on FOG and control variables. Analysts information content is defined as the proportion of firm stock 
returns related to analyst forecast revisions to the total firm stock return during the time period between the 10-K filing and 
the subsequent fiscal year-end. FOG ОХ is an indicator variable set to 1 if the firm's 10-К Fog Index is in quartile X of the 
sample, and 0 otherwise. All other variables are as defined in Table 2. Industry fixed effects are based on the Fama and 
French 48-industry classification. t-statistics are robust to heteroscedasticity and clustered at the firm level. 


information content of analyst reports by about 0.29 percent to 0.46 percent, on average, across the 
three models. Consistent with our finding for analyst following, we find that the effect of an 
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interquartile change in disclosure readability (FOG) is similar to many of the other variables in our 
analysis. 


Properties of Analyst Earnings Forecasts 

Given our findings regarding the association of disclosure readability with analyst following, 
analyst report duration, and the information content of analyst reports, a reasonable next step is to 
examine its association with properties of analyst forecasts. In particular, we focus on the disper- 
sion, accuracy, and the overall and common uncertainty associated with analyst forecasts of annual 
earnings. As previously explained, we hypothesize that less readable disclosures will be associated 
with greater forecast dispersion and lower analyst forecast accuracy. We also predict that overall 
analyst uncertainty will be higher for firms with less readable disclosures, but make no directional 
prediction regarding analyst common uncertainty. 

For each of these hypotheses, we estimate a model of the following form: 


Forecast Property; = By + В. FOG, + B,LOGSIZE,, + B;GROWTH,, + BsLOGSEGMENTS, , 
+ BsPINST,, + B9MFCOUNT, , + В,10-К NEWS,,+ ВзАРУ,, 
+ BoR&D;, + BioSTD, RET, ,* Ein, (6) 


where “Forecast Property" represents the previously defined variable of interest (DISPERSION, 
ACCURACY, UNCERTAINTY oygrar or UNCERTAINTY common). Each model is estimated using 
ordinary least-squares regression with industry and time fixed effects. Table 7 presents the results. 
t-statistics, presented in brackets, are based on standards errors that are robust to heteroscedasticity 
and clustered at the firm level. 

As reported in the first two columns of Table 7, we find that the coefficient on FOG is positive 
and significant, indicating that less readable disclosures are associated with more dispersed and 
less accurate analyst earnings forecasts. Consistent with prior research, these results, in conjunc- 
tion with our findings about the information content of analysts' reports, suggest that analysts 
provide information that investors find valuable at the cost of less accurate forecasts (Schipper 
1991; Clement and Tse 2003). It is, however, difficult to assess the significance of these benefits 
because we cannot observe the appropriate benchmark—i.e., the effect on investors of analysts 
choosing not to follow firms with less readable disclosures.” 

We further examine these results by testing the effect of 10-K readability on measures of 
overall and common analyst forecast uncertainty, as derived by Barron et al. (1998). We report our 
results in columns three and four of Table 7. Consistent with our prediction, we find that overall 
analyst uncertainty is increasing in FOG, suggesting that there is higher overall uncertainty in the 
analyst information environment for firms with less readable 10-K filings. Further, we find that the 
proportion of common analyst uncertainty to overall uncertainty is also increasing in FOG, sug- 
gesting that as readability decreases, publicly available information, such as the 10-K, becomes 


27 As an additional sensitivity test, we follow the suggestion of a reviewer and examine the éffect of analysts’ conflicts of 
interest associated with the Global Research Analyst Settlement to support our inferences. Because analysts with 
conflicts of interest are likely to place less importance on firm disclosures, we predict that our results for analyst 
following and report information content should be weaker when analysts’ conflicts of interest are higher. We note, 
however, that other important events such as Regulation Fair Disclosure occurred around this time and could limit our 
inferences from this test. In untabulated results, we find that, consistent with our prediction, the association between 
analyst following and analyst report information content is stronger in the years following the Global Research Analyst 
Settlement, suggesting greater analyst reliance and attention to 10-K filings in the post period. This test further supports 
our overall conclusion regarding the association between analyst behavior and 10-K readability. 

28 Related to this point, Lee (2010) finds that less readable 10-Q filings are associated with lower information efficiency 
and greater information asymmetry. She also finds that these negative consequences of lower readability are mitigated 
by analyst coverage, which provides evidence of the benefits of analysts following firms with less readable reports. 
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TABLE 7 


The Association between Annual Report Readability and Properties of Analysts? 
Information Environment 


Variable DISPERSION ACCURACY UNCERTAINTY укваї;, UNCERTAINTY oMMON 
Intercept 0.0192*** — 0.2887 —0.0142 —0.0440 
[4.04] [1.35] Г-0.37| [-0.56] 
FOG 0.0003*** 0.0041%% 0.0044" * * 0.0053** 
[3.96] [2.10] [2.58] [2.23] 
LOGSIZE —0.0027*** —0,0315*** —0.0133*** —0.0012 
[—22.00] [--9,89] [-7.19] [-0.43] 
GROWTH 0.0008 0.0077 —0.0005 0.0638*** 
[1.08] [0.45] [-0.04] [5.69] 
LOGSEGMENTS 0.0008*** 0,013455 0.0025 0.0081 
[3.74] [1.96] [0.54] [1.24] 
PINST —0.0001*** 00001 —0.0001 0.0010 * 
[-8.45] [0.49] [-0.77] [5.48] 
MFCOUNT —0.0001*** 00010 -0.0001 0.0034*** 
[-2.92] [1.10] [-0.11] [2.79] 
10-K NEWS 00511555. 0).7263*%% 0.5591 **** 0.3838 * ** 
[7.48] [5.80] [4.75] [4.37] 
ADV 0.0266*** — 0,3901 ** 0.0933 0.0255 
[4.13] [2.13] [1.03] [0.18] 
R&D 0.0081*** 0.0266 — 0.0036 —0.1394*** 
[5.97] [-0.87] [-0.14] [-4.39] 
STD_RET 0.0242*** — 0,3793*** 0.2200*** 0.3683*** 
[9.27] [6.03] [4.36] [6.72] 
Time Fixed Effects Yes Yes Yes Yes 
Industry Fixed Effects Yes Yes Yes Yes 
п 26078 29055 17241 17222 
Adjusted В? 0.20 0.05 0.04 0.06 


Уж ЖЖ y < 0.05, and р < 0.01, respectively, two-tailed t-tests. 

This table reports coefficient estimates and t-statistics (in brackets) from regressions of analyst earnings forecast dispersion, 
accuracy, and uncertainty on FOG and control variables. Analyst earnings forecast dispersion is defined as the standard 
deviation of the individual analyst earnings forecasts in the most recent consensus earnings forecast following the annual 
report scaled by price. Analyst earnings forecast accuracy is defined as the squared value of the difference between the 
1/В/Б/5 actual reported earnings and the most recent analyst consensus earnings forecast following the annual report filing 
sealed by price, Analyst overall and common uncertainty are calculated following the equations derived by Barron et al. 
(1998). All other variables are as defined in Table 2. Industry fixed effects are based on the Fama and French 48-industry 
classification. t-statistics are robust to heteroscedasticity and clustered at the firm Ісусі. 


more important to analysts relative to private information. This suggests that analysts have diffi- 
culty producing private information for firms with less readable reports. Overall, the evidence 
from the four regressions in Table 7 suggests that analyst earnings forecasts are affected by 10-K 
readability, supporting the notion that analysts use the information contained in the 10-K filings. 

The resuits for the control variables are largely consistent with the notion that analysts per- 
form better for firms with better information environments and less complex operations. We find 
that firms with better information environments, as measured by size and institutional ownership, 
are associated with greater accuracy, lower dispersion, and lower uncertainty. Institutional own- 
ership is also positively associated with analyst common uncertainty, consistent with these inves- 
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tors improving the importance of public information. We find that firms with greater uncertainty 
and complexity, as measured by the volatility of returns, the number of business segments, and the 
extent of intangible investment, are generally associated with lower forecast accuracy, greater 
dispersion, and greater overall uncertainty. However, their relation to analyst common uncertainty 
is less clear. While R&D investment is negatively related to common uncertainty (because less 
public information on these investments is available), return volatility is positively related. These 
findings suggest that private information is less important relative to common information for 
firms with high return volatility. While the measures of firm disclosure, MFCOUNT and 10-K 
NEWS, are both positively associated with analyst common uncertainty (.е., public disclosures 
increase the importance of common information), they differ with respect to analyst forecast 
dispersion. MFCOUNT is associated with lower forecast dispersion, consistent with managers 
providing guidance to reduce analyst disagreement, while 10-К NEWS is associated with greater 
dispersion and lower accuracy. 


У, SUMMARY AND CONCLUSION 

From the passage of the Securities Acts in 1933 and 1934 to the present, regulators, legal 
scholars, and various other parties have weighed in on the debate about the complexity of financial 
communication to external users. The SEC has gone so far as to require that the prospectuses of 
all registered public offerings meet the requirements of the Plain English Rules, and has suggested 
that similar procedures should be applied to other mandatory filings. Former SEC Chairman 
Christopher Cox has even suggested the use of readability models such as the Fog Index to 
measure the complexity of financial communication (Cox 2007). However, other users are con- 
cerned about the effects that these types of actions could have on the disclosure regime. These 
parties argue that, at a minimum, these actions are a waste of time and effort, and could lead to a 
reduction in disclosure as firms are forced to simplify their public communication (e.g., Kripke 
1970, 1973; Firtel 1999). This study sheds insight relevant to this debate by examining the effect 
of readability on one important financial information intermediary—sell-side financial analysts. 

We find evidence consistent with the notion that, because less readable firm communication is 
more costly to process and interpret, investors demand greater amounts of analyst services for 
firms with less readable communication. We find that analyst following is greater for firms with 
higher levels of syntactic complexity as measured by the Fog Index. We also find that analysts 
who cover firms with less readable communication take longer time on average to issue reports in 
response to 10-K filings. We interpret this evidence as analysts exerting greater effort to cover 
these firms. In addition, we find that analyst reports of firms with less readable 10-K reports are 
more informative to investors, but that the earnings forecasts of such firms have greater analyst 
dispersion or disagreement, are less accurate, and are associated with greater levels of uncertainty. 

Our results suggest that analyst behavior is related to the readability of firms’ communication. 
While prior studies have found that analysts are affected by the complexity of individual financial 
items, we provide evidence that the overall linguistic complexity of firms’ communication incre- 
mentally influences analyst behavior over and above the effects of the content of the document 
(e.g., taxes, interest rates). Finally, our results that analysts provide greater amounts of information 
to investors for firms with less readable communication and that investors consider this informa- 
tion informative are relevant to the SEC's debate about the intended audience of financial infor- 
mation and the SEC's concerns on the accessibility of these reports. While the SEC moves to 
reduce the complexity of firms' communication, further research is needed to examine the explicit 
costs and benefits of such actions. Ав a final caveat, while the collection of the evidence from our 
various analyses of analysts’ behavior conducted in this study is consistent with the prediction that 
10-K readability influences analyst behavior, our individual findings still are subject to the limi- 
tation that the documented relations reflect associations and may not be fully causal. 
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JOAN LUFT and MICHAEL. D. SHIBLDS, Psychology Models of Management Accounting, 
Foundations and Trends? in Accounting (Hanover, MA: now Publications, Inc., 2009, 
ISBN 978-1-60198-346-6, Vol. 4, No. 3-4, pp. 151). 


This monograph is the newest in the series entitled "Foundations and Trends? in Accounting." The objective 
of the series is to provide frameworks for understanding and organizing the vast array of existing literature in 
various fields of accounting research. This issue is co-authored by Joan Luft and Mike Shields, two thought leaders 
in the field of psychology-based research in management accounting. The main objective of the monograph is to 
devise a framework that links what we know about the subjective decisions made by management accountants 
with the theories in psychology that have proven useful to our understanding of how these decisions are made. 
These authors have previously developed a comprehensive study of the causes and consequences of management 
accounting based on an analysis of 275 papers across six leading journals (see Luft and Shields 2003). The current 
monograph provides a framework for understanding a much more specific and more focused piece of the man- 
agement accounting literature, that is, the psychological processes underlying the subjective decisions required of 
management accountants in practice. 

Given the vast array of possible pieces of the literature to review, the reader might ask, "Why is this book 
needed now?” Interest in the field of behavioral decision research in accounting has spiked due at least in part to 
the awarding of the Nobel Prize in Economic Sciences in 2002 to Daniel Kahneman and Amos Tversky for their 
work on the psychology of judgment and decision-making and its application to the field of behavioral economics. 
Although much psychology-based research in management accounting took place before 2002, the recognition of 
behavioral economics as an important economic discipline seems to have moved behavioral decision theory in 
management accounting to the forefront. The literature reviewed in this monograph reflects the best of what has 
been published in this area since then. 

This monograph should prove useful to management accounting scholars and doctoral students interested in 
behavioral decision theory and its applications in management accounting. Doctoral students would benefit from 
the authors’ framework for thinking about subjective decision making in management accounting. In addition, the 
organized framework allows scholars to more easily identify gaps in the literature that they may wish to fill. 

Do not be confused, as this not a one-to-one mapping of decision-making categories to psychological 
theories; in fact, some psychological theories are discussed as underpinnings of several of the decision categories 
identified by the authors. Such confusion may be driven in part by the title—Psychology Models of Management 
Accounting. The authors do discuss models, but I would argue they are management accounting models of 
subjective decision making. I might re-title the monograph "How Psychological Theory Informs our Understand- 
ing of Subjective Decision Making in Management Accounting." 

The monograph is structured in seven chapters plus an appendix. The authors have a clear and concise 
writing style reflective of many years of academic writing experience. Chapter 1 provides an introduction to the 
monograph's purpose and objectives as well as an overview of the authors' organizing framework for subjective 
decision making in management accounting. The organizing framework consists of four categories of subjective 
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decision making in management accounting and two emerging themes. The authors argue that there is robust 
evidence consistent with psychological theory underlying the four categories, while there is some interesting, but 
perhaps less convincing (or at least less well developed), evidence to support the two emerging themes. 

Chapter 2 is meant to describe the methodology used by the authors to develop their categorization scheme. 
The most important information which the reader needs to understand, namely the methodology, is unfortunately 
relegated to the appendix and the last paragraph of Chapter 7. The authors instead devote much of Chapter 2 to an 
argument in favor of cognitive task analysis (1.е., a description and analysis of the knowledge and cognitive 
activities required for high performance on specific tasks) in management accounting. The authors argue that we 
have not made a point as a discipline to perform detailed cognitive task analyses of the type reported in the audit 
literature. Consequently, any framework for understanding how psychology affects management accounting deci- 
sions must be organized in an alternate form. Although this issue seems somewhat beside the point in the context 
of the chapter material, the authors’ argument concerning the importance of cognitive task analysis (or lack 
thereof) in management accounting is well taken. The field would benefit from a knowledge of the detailed 
cognitive processes involved in subjective decision making in management accounting. 

At this point, the reader should be aware of the importance of the appendix in order to obtain a good 
understanding of the authors’ methodology. The appendix walks the reader through two examples, one decision- 
facilitating and one decision-influencing, of the use of management accounting information in subjective decision 
making. The decision-facilitating example is based on the “grand model” of product capacity planning and pricing 
developed by Balakrishnan and Sivaramakrishnan (2002). The decision-influencing example references the work 
of Feltham and Xie (1994), who model the weights placed on performance measures in a multi-measure perfor- 
mance evaluation scheme. In both cases, the authors point out the many individual (subjective) decisions which 
managers must make in the course of capacity planning and performance evaluation. The implication is that the 
authors grouped the various individual decisions made in the process of capacity planning and performance 
evaluation into the categories listed in Chapter 2. 

While the examples provided in the appendix are somewhat informative, the link between the examples and 
the categories derived from them is rather indirect. The authors are not particularly forthcoming about how the 
appendix maps onto the categories; consequently, the reader must infer the mapping. While this would not be too 
difficult for experienced management accounting scholars, doctoral students or non-management accounting ге- 
searchers would have a more difficult time understanding how the lists provided in the appendix distill down to the 
four categories identified by the authars in developing their framework, potentially limiting its usefulness. 

Chapters 3-6 are devoted to discussions of each of the four subjective decision-making categories. For each 
category, the authors describe the important elements of the decision process and identify relatively recent studies 
that examine the type of decision under consideration. The authors do a nice job of summarizing each paper and 
its potential contribution as well as haw later papers build on earlier papers to fill the gaps in our understanding. 
Chapter 3 examines subjective valuation of management accounting-related monetary payoffs, linking these 
mainly to cognitive psychology, including framing and prospect theory. Chapter 4 examines subjective valuation 
of non-monetary payoffs, including fairness, honesty, trust, and reciprocity, mainly supported by theories of social 
and relational exchange drawn from social psychology. Chapter 5 examines the relatively simple subjective 
decision models used to perform many management accounting-related tasks (e.g., subjective linear programming, 
mental models, non-compensatory decision models, effects of opportunity, and sunk costs), while Chapter 6 
examines models of parameter and variable acquisition and subjective estimation (e.g., subjective estimation of 
model parameters using accounting information, effect of accounting knowledge, and experience on this subjective 
estimation). Each of Chapters 3-6 also includes an exhibit indicating the link between causes (i.e., management 
accounting independent variables) and effects (i.e., management accounting dependent variables) through particu- 
lar psychological theories. In addition, the exhibits indicate which of the management accounting studies reviewed 
in the chapter address the links identified in the exhibit, thus providing nice visual summaries of the main points 
raised in each of these chapters. 

After reading these chapters, I wondered how the particular illustrative articles were selected by the authors 
for inclusion in the monograph. Most of the illustrative articles were published since 2000, and all were published 
in top-tier accounting journals. The list of articles included in the monograph is by no means exhaustive, nor do 
I believe the authors meant it to be. The authors justify their focus on recent work because it reflects the "growth 
and dynamism of psychology-based research" on management accounting (p. 124). This seems reasonable. In 
addition, 1 suspect the authors selected the papers based on their extensive collective experience teaching and 
researching subjective decision making in management accounting. 

Finally, Chapter 7 discusses the two identified emerging themes to motivate future work. The emerging 
themes are (1) that two or three distinctly different subjective models account for most management accounting 
decisions, and (2) the important differences between intuitive or "gut-feel" decision making (or, as psychologists 
have labeled them, "System 1 cognitions”) and deliberative or consciously controlled decision making (i.e., 
“System 2 cognitions”) and implications of these for management accounting. According to the authors, these 
areas are ripe for future research. In addition, the authors identify several important, but relatively unexplored, 
research questions that could benefit from the application of insights from psychology, including: 
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* Why not study larger, more complex decision models that better reflect real-world decision making? 

* What drives selective information acquisition, given that modern information systems can provide so 

many different pieces of information? 

• Why don’t we see more non-laboratory studies similar to Ittner et al. (2003) to examine management 

accounting subjective decision making in real-world settings? 

In conclusion, I believe this book provides a good primer for scholars new to management accounting 
behavioral research on subjective decision making. For scholars already working in the field, the authors provide 
an organizing framework for thinking about the vast array of work in this area. This allows us to figure out where 
our research fits within the current literature and to pinpoint areas where future work is needed to move the 
literature ahead. 

»For many years, we have talked about integrating economics and psychology-based research in management 
accounting. Because of the increased level of interest in behavioral economics and behavioral finance, many 
economics-based researchers are becoming more interested in identifying behavioral theories that would allow 
them to gain additional insight into their research questions of interest. My hope is that this increased interest leads 
to fruitful partnerships between economics- and psychology-based researchers in management accounting. In my 
opinion, this monograph provides a much-needed tool to inform economics-based researchers of the "state of the 
art" in subjective decision-making research in management accounting. Consequently, this monograph should be 
helpful in further encouraging such partnerships. 
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CHRIS POULLAOS and SUKI SIAN (editors), Accountancy and Empire: The British 
Legacy of Professional Organization (New York, NY: Routledge, 2010, ISBN 978-0- 
415-45771-2, pp. xv, 263). 


This book discusses the historical development of the accounting profession in different parts of the British 
Empire. Its objective is to address a gap in the literature "by tracing the rise of professional accountancy in 
selected colonial outposts and examining the role of the institutions regulating it, within the context of the empire 
experience" (p. 2). It includes contributions of expert academics, some of them having published extensively on 
the development of accounting professions in different contexts, others focusing their research on the region that 
they discuss in their contribution to this work. Some of the chapters build upon previously published work, 
whereas others try to explore new aspects of the history of accountancy. Some of the countries that are discussed 
in this book are also found in Poullaos (2009), which can serve as a more general introduction to the history of 
accountancy as a profession. 

Although the different chapters clearly focus on the same type of issues, their development is rather variable. 
This is partly a consequence of the different historical methodologies on which they are based. In Chapter 2 (on 
South Africa, Australia, and Canada), Chris Poullaos has relied on an extensive set of primary sources from, e.g., 
the ICAEW archives, the National Archives, and the Scottish Records Office, It is the only contribution that relies 
heavily on unpublished archival sources. Suki Sian (Chapter 10, on Kenya) and Shraddha Verma (Chapter 9, on 
" India) use a mixture of primary and secondary sources. Chibuike 17, Uche (Chapter 4, on Nigeria) refers also to 
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email correspondence with representatives of professional bodies. The contribution of Devi S. Susela (Chapter 5, 
on Malaysia) is based on 65 interviews, whereas P. W. Senarath Yapa (Chapter 6, on Sri Lanka) uses information 
from nine unstructured interviews. The other contributions rely mainly on secondary sources. 

The book covers a range of countries, including the self-governing (white) settler colonies, South Africa, 
Australia, and Canada, and non-settler colonies such as Nigeria, Malaysia, Sri Lanka, Jamaica, Trinidad and 
Tobago, India, and Kenya. Consequently, it covers most regions of the British Empire, the only notable exception 
being what could be considered the Arab part of the Empire. Although the current selection of former colonies 
provides contrasting examples of the uptake of the British models, it would be interesting to study whether the 
situation in protectorates such as British Somaliland, Yemen, and Oman was any different from that in the other 
paris of the Empire. 

The subtitle of the book, “The British Legacy of Professional Organization,” is highly relevant, as it clarifies 
that the focus of the contributions is on the organization of accountancy in occupational groupings, not on the 
development of accounting and auditing practices. Only in some chapters is attention given to the development of 
individual audit firms, most notably in the chapter on Trinidad and Tobago. All chapters provide some background 
on the countries involved. For most of them, the information provided is sufficient to understand the issues at 
stake, but in some chapters more circumstantial information would make the reading easier. Most chapters center 
strongly on the accounting profession, and one wonders whether similar discussions and developments could be 
found in other professions. 

In most of the chapters, the key issue is the development of a local accounting profession and the relation 
between British accountancy bodies and overseas associations. An overall impression after reading the book is that 
this was to a large extent a one-way relation, with, in a first stage, British accountants being “exported” to the 
different parts of the Empire and, in a later phase, locally trained accountants who organized themselves, very 
often following British models and with direct or indirect interventions by the U.K. bodies. Even the competition 
that existed between the U.K. bodies in their home market had a clear impact on developments overseas. This is 
also apparent in the concluding section, where Poullaos and Sian address how the organization of accountants in 
different parts of the empire has been affected by the empire experience. This section also focuses on the relation 
between British capital and the development of colonial professional organization. It explains how South Africa, 
Australia, and Canada were “decolonized” earlier than the other colonies in terms of formal control over accoun- 
tancy. In many of the non-settler colonies, local legislation confirmed the continued impact of British reporting, 
auditing, and professional requirements. It led to professional exclusion processes both before and after political 
independence. In these countries, local accountants typically could organize themselves only after decolonization. 
The closing section also discusses how the empire experience had an impact on the organization of accountancy 
in Britain. However, as the different chapters show, this impact seems limited: contrary to the many British 
expatriate accountants who went to, for example, Kenya, the opposite flow of trainees was very small. An 
interesting extension of the current research might focus on analyzing quantitative data on, e.g., the number of 
non-British citizens who were members of the Institute of Chartered Accountants in England and Wales and other 
U.K. bodies (see also Briston and Kedslie 1997). Some chapters give the impression that they were very rare (e.g., 
the chapter on Nigeria), but others indicate that there were “many locals” who went to the U.K. for professional 
qualification (e.g., the chapter on Malaysia). 

Apart from the introductory and concluding chapters, each chapter focuses on a group of related colonies or 
dominions or on an individual country. Poullaos discusses the evolution of the imperial accountancy arena during 
the 1920s, focusing on South Africa, Australia, and Canada. He extends the earlier work of Chua and Poullaos 
(2002) on the interactions between British accountants and their associations on the one hand, and dominion 
accountants on the other hand. Issues at stake include the use of the “chartered accountant” designation, access by 
British accountants to dominion markets and of dominion accountants to British designations, and participation in 
British examinations without having to travel to Britain. The chapter provides a quite detailed account of initia- 
tives, events, and negotiations and demonstrates how, by 1930, South African, Canadian, and Australian accoun- 
tants in different ways had strengthened their position vis-à-vis the British profession. 

Canada is also the subject of the second chapter, but Alan J. Richardson draws attention to the fact that the 
British Empire was not the only imperial power that had an impact on the development of accountancy in Canada. 
He argues that the Canadian profession was shaped by French, British, and American institutional legacies and that 
the impact of each of these changed over time. These competing influences still exist today, and their inconsis- 
tencies allowed Canada to go its own way. Changing professional designations (chartered accountants, certified 
public accountants ...) exemplify different paths in the development of the profession. Although, overall, the 
American influence increased, this influence seems to have been more on the standard-setting processes than on 
the development of professional structures. The chapter ends with a number of examples in which Richardson 
shows Canada's position as a "middle power" with a distinct point of view.on many issues. 

The chapter on Jamaica makes it clear that the power struggle between the U.K. institutes was exported to the 
colonies. This is most obvious in the opposition between the Association of Certified Corporate Accountants (as 
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the ACCA was then known) and the other bodies. Different professional bodies coexisted, some linked with U.K. 
bodies, others being led by indigenous accountants. Again, professional designations were important, with con- 
flicts between “chartered” and "certified" accountants that were often related to attempts to become more inde- 
pendent of the British accounting bodies. Specific issues that are discussed in the chapter are the attempt to create 
an accounting degree at the University of the West Indies that would serve as an entry qualification for the 
profession, and the entry of the American Institute of Certified Public Accountants. The ACCA played a similar 
role in Trinidad and Tobago (T&T). Marcia Annisette’s contribution focuses on the evolution of the practice link 
between Britain and T&T. She starts from the dominance of the large multinational audit firms in the 1990s and 
traces their history in T&T. There were two different phases in the internationalization of accounting practice, the 
first one linked with colonialism and British accountancy, the second one in the 1960s linked with U.S.-based 
multinational practices. This increasing impact of international networks should be seen in a context of growing 
international standardization of accountancy practices. 

There are two contributions on African countries, in which Uche discusses Nigeria and Sian treats Kenya. 
Uche observes that, around the mid 20th century, Nigeria’s export trade was dominated by European firms and that 
expatriate (U.K.) accountants were the only ones present in the country. This colonial influence still continues and 
impedes training second-tier accountants who could provide services for smaller local business. Only with the 
coming of independence did Nigerians become qualified as accountants. It is interesting to note that, again, the 
ACCA played a major role in this, probably due to the structure of its training programs. Uche reports in detail the 
continuing impact of the ACCA and its interactions with the formation of local professional bodies. In a similar 
way, Kenya could not break radically with its colonial past following independence in 1963. Its situation was quite 
comparable to that of Nigeria, a difference being а stronger economic influence of Asians in the former. Sian 
suggests that the creation of the Association of Accountants in East Africa, acting as an interest group for British 
accountants, might have been an attempt to close off the market for non-whites. After independence, movements 
toward an indigenous profession remained influenced by British concepts of professionalism and British imperial 
history, although eventually an autonomous profession emerged. 

Asian countries included are Malaysia, Sri Lanka, and India. Also in these countries, the continuing impact 
of British accountants and resistance against this influence had a profound impact on the development of accoun- 
tancy. The development of the profession in Malaysia is characterized by the complex relation between the 
Malaysian Association of Certified Public Accountants, the Malaysian Institute of Accountants (a government 
body), and British professional bodies. This relation seems quite interesting, and it shows some specific charac- 
teristics. One of these is that Malaysia was not allowed to use the CA designation, a point that is not fully 
explained in the chapter. The situation in Sri Lanka seems at first rather comparable, but then again different. 
Competition there was mainly between the Institute of Chartered Accountants of Sri Lanka (ICASL, having 
members with British qualifications) and the Registered Accountants (local graduates, considered to be the lower 
tier of the profession), The ICASL managed to reserve the public practice market to a limited number of elite 
accountants, a closure that it could maintain until 1977. The Indian professional model was also based on the 
British model. However, interventions by the Indian government were significant, and they resulted in changes in 
the regulation of the profession. The chapter provides an interesting summary of this interaction between state and 
profession. Also here, the use of the “chartered” designation was an issue and seems linked with the social 
stratification within the profession. 

The issue of the designation of accounting professionals can perhaps be seen as а red thread throughout the 
book. Many debates on the relation between British accountants, British accountancy bodies, and local profes- 
sionals seem to translate into claims on the “Chartered Accountant” designation that apparently served as a quality 
label (see also Parker 2005). 

In conclusion, this book provides an interesting overview of how professional structures and concepts that 
were introduced under the rule of the British Empire continued to have an impact after the independence of the 
countries involved. For scholars who want to understand the current status and organization of the accounting 
profession in these countries, this book is essential reading. Generally speaking, the way these professions are 
organized is shaped both by a continuing colonial legacy and by the way local governments and professionals have 
interacted with these colonial remains in order to create an autonomous profession of indigenous accountants. 
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Footnotes: Footnotes are not used for documentation. Textual footnotes should be used only for extensions 
and useful excursions of information that if included in the body of the text might disrupt its continuity. 
Footnotes should be inserted using the “footnote” or “endnote” feature of the word processing software which 
will automatically number the footnotes throughout the manuscript with superscript Arabic numerals. 


SUBMISSION OF MANUSCRIPTS 
Authors should note the following guidelines for submitting manuscripts: 


1. Manuscripts currently under consideration by another journal or publisher should not be submitted. The 
author must state upon submission that the work is not submitted or published elsewhere. 

2. For manuscripts reporting on field surveys or experiments: If the additional documentation (e.g., ques- 
tionnaire, case, interview schedule) is sent as a separate file, then all information that might identify the 
author(s) must be deleted from the instrument. 

3. Through May 31, 2011, manuscripts should be submitted via email as Microsoft Word or PDF file to the 
Senior Editor, Steven Kachelmeier, at email address tar@mccombs.utexas.edu. Please submit separate 
files for (1) the manuscript’s title page with identifying information (not forwarded to reviewers), (2) the 
manuscript with title page and all other identifying information removed, and (3) any necessary supple- 
mental files, such as experimental instructions and/or response memoranda on invited revisions. The body 
or subject line of the email should also indicate the submission fee receipt code from AAA (see step 5 
below). 

4. Beginning June 1, 2011, please check online at https://aaahq.org/A A Aforms/journals/tarsubmit.cfm for 
instructions on how to submit a new manuscript to the incoming editor, John Harry Evans III. Revisions 
of manuscripts originally submitted before June 1, 2011 should be sent via email to the incoming editor, 
John Harry Evans Ш, at tar katz.pitt.edu. 

5. A nonrefundable submission fee is required unless explicitly waived by the editor for invited revisions of 
previous submissions. The submission fee in U.S. funds is $200.00 for members and $400.00 for non- 
members of the AAA payable by credit card (VISA or MasterCard only). The payment form is available 
online at: https://aaahq.org/AA Aforms/journals/tarsubmit.cfm. If you are unable to pay by credit card or 
have any questions, please contact the AAA Member Services Team at (941) 921-7747 or info aaahq.org. 

6. Revisions must be submitted within 12 months from the decision letter inviting a revision. 


COMMENTS 


The Accounting Review welcomes submissions of comments on previous ТАК articles. Comments on articles 
previously published in The Accounting Review will generally be reviewed by two reviewers, usually includ- 
ing an author of the original article (to assist the editor in evaluating whether the submitted comment 
represents the prior article accurately) and an additional reviewer who is independent of the original article. 
If a comment is accepted for publication, the original author will generally be invited to reply. АП other 
editorial requirements, as enumerated above, apply to proposed comments. 


POLICY ON REPRODUCTION 


An objective of The Accounting Review is to promote the wide dissemination of the results of systematic 
scholarly inquiries into the broad field of accounting. 

Permission is hereby granted to reproduce any of the contents of the Review for use in courses of 
instruction, as long as the source and American Accounting Association copyright are indicated in any such 
reproductions. 

Written application must be made to the American Accounting Association, 5717 Bessie Drive, Sarasota, 
FL 34233-2399, for permission to reproduce any of the contents of the Review for use in other than courses 
of instruction—e.g., inclusion in books of readings or in any other publications intended for general distri- 
bution. In consideration for the grant of permission by the Review in such instances, the applicant must notify 
the author(s) in writing of the intended use to be made of each reproduction. 

Except where otherwise noted in articles, the copyright interest has been transferred to the American 
Accounting Association. Where the author(s) has (have) not transferred the copyright tc the Association, 
applicants must seek permission to reproduce (for all purposes) directly from the author(s). 
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POLICY ON DATA AVAILABILITY 


The AAA’s Executive Committee policy (originally adopted in 1989, and amended in 2009) is that the 
objective of the Association-wide journals (The Accounting Review, Accounting Horizons, Issues in Account- 
ing Education) is to provide the widest possible dissemination of knowledge based on systematic scholarly 
inquiries into accounting as a field of professional research, and educational activity. To fulfill this objective, 
authors are encouraged to make their data available for use by others in extending or replicating results 

. reported in their articles. 
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MAY 2011 PLACEMENT ADS 


The deadline for free position ads to be included in this section is two months prior 
to the date of publication. The Accounting Review is published in January, March, 
May, July, September, and November. Position ads in the “Placement Ads” are free 
with the purchase of a job posting in the AAA Career Center. For more information 
on how to purchase a job posting in the Career Center and receive your free position 
ad in The Accounting Review, go to the AAA website at http://aaahq.org and click on 
“Career Center,” or call our office at 941-921-7747. 


THE UNIVERSITY OF GEORGIA, Department of Housing and Consumer Economics College of 
Family and Consumer Sciences is seeking candidates for an Endowed Professor in Family Financial 
Planning with an emphasis in estate planning, planned giving, or comprehensive financial planning. 
Nine-month academic appointment. An endowed professorship is associated with this position. 
Positicn begins August 1, 2011. Qualifications: Significant academic contributions to the field of 
financial planning. Demonstrated record of outstanding scholarly achievement in research produc- 
tivity, external funding, and successful college teaching experience desired. Current academic rank 
should be that of Professor, or work should be commensurate with the rank of Professor. For a 
complete position description and more information, please visit our website at http:// 
www.Lr.uga.edu/hace-prof.pdf. Application Procedure: To receive full consideration, applications 
should be submitted prior to April 3, 2011; however, the position will remain open until filled. 
Application materials should include a cover letter of interest, which outlines relevant experience, 
supports your fit for the position, as well as a current curriculum vitae. Email submissions with 
attachments are preferred. Please send to: executivesearch @uga.edu. Confidential requests for in- 
formation should be directed to: Executive Michael Luthi, Search Consultant, Executive and Faculty 
Search Group, (706) 542-1837. The University of Georgia is an Equal Opportunity/Affirmative 
Action Institution. 


THE UNIVERSITY OF ALABAMA, Culverhouse School of Accountancy invites applications for a 
tenure-track appointment at the Assistant Professor level beginning August 16, 2011. Candidates 
should have an earned doctorate (or expect to complete their doctorate by August 2011) and exhibit 
excellence in teaching and the capacity for research publishable in top-tier accounting journals. The 
faculty member can expect to teach undergraduate- and graduate-level accounting courses, although 
current needs are driven by increasing graduate enrollment. The faculty member will have the ability 
to work with doctoral students and engage in University, College, and Departmental service activi- 
ties. We welcome applications in all areas of accounting research and teaching. Screening of can- 
didates will begin immediately and will continue until the position is filled. Salary is competitive. 
Candidates selected for interview will be required to submit a disclosure and consent form autho- 
rizing a background investigation. Only online applications are accepted. Candidates must apply 
online at: https://facultyjobs.ua.edu. Required documents include: resume/curriculum vitae, disser- 
tation proposal and/or example of scholarly wark, and a listing of four references. The University of 
Alabama is an Equal Opportunity Educational Institution/Employer. Women and Minorities are 
encouraged to apply. | 
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CENTRAL CONNECTICUT STATE UNIVERSITY, Accounting Department invites applications 
for a tenure-track Assistant/Associate Professor (#C11-058) beginning Fall 2011. Teaching interest 
in financial accounting and/or auditing is preferred. Candidates must have a commitment to excel- 
lence in teaching and scholarship. Qualifications: Earned doctorate in accounting or equivalent from 
an accredited school preferred, ABD with a 2011 expected completion date required. Professional 
certifications and/or experience are desirable. Teaching interest in financial accounting and/or andit- 
ing is preferred. Candidates must have a commitment to excellence in teaching and scholarship. To 
apply: Complete the Application Cover Page at: http://w www.ccsu.edu/AffAction/. Send the cover 
page, along with a letter of application, your resume, and the names, addresses, and telephone 
numbers of three references via email to: Professor Lawrence Grasso, GrassoLa@ccsu.edu. Appli- 
cations received by February 15, 2011 will receive preference of review. CCSU aggressively pursues 
a program of equal employment and educational opportunity and affirmative action. Members of all 
underrepresented groups, women, veterans, and persons with disabilities are invited and encouraged 
to apply. 


UNIVERSITY OF SOUTH CAROLINA, School of Accounting is accepting applications for a 
tenure-track position beginning Fall 2011. Candidates should have primary teaching and research 
interests in international accounting, and must possess a doctorate or expect to complete it by August 
2011. In addition to being committed to excellent teaching, candidates must be ambitious and 
motivated researchers who are committed to publishing in top-tier accounting journals. Salaries are 
competitive and commensurate with experience and achievements. All faculty searches are subject 
to the availability of funding. Submit curriculum vitae, working papers, and defended dissertation 
proposal with data to: Professor Al Leitch, Darla Moore School of Business, University of South 
Carolina, Columbia, SC 29208; or Email: leitch@moore.sc.edu, The University of South Carolina is 
an AA/EOE. Minorities and women are strongly encouraged to apply. 


SAINT LOUIS UNIVERSITY, a Catholic, Jesuit institution dedicated to student learning, research, 
health care, and service, is seeking applicants for a tenure-track position in Accounting at the Senior 
Assistant/Associate/Full Professor level, beginning Fall 2011. Candidate must demonstrate a com- 
mitment to teaching excellence and quality research and publication. Service to the department and 
University are also required in addition to teaching and scholarship. Professional certification is 
desirable. Teaching interest is open and all areas will be considered except financial accounting. A 
Ph.D. in Accounting is required. Dissertation-stage candidates could be considered under excep- 
tional circumstances. Compensation package is competitive. Interested candidates must apply online 
at http://jobs.slu.edu (Req. 20100775), and submit a cover letter and current curriculum vitae. All 
other correspondence regarding this position can be sent to: Dr. Ananth Seetharaman, Chair of 
Accounting, John Cook School of Business, Saint Louis University, 3674 Lindell Boulevard, St. 
Louis, MO 63108. Saint Louis University is an Affirmative Action/Equal Opportunity Employer, and 
encourages nominations of and application from women and minorities. 


SABANCI UNIVERSITY (SU), School of Management invites applications for full-time/visiting 
positions in Accounting for the year 2011-2012. While pre-doctorate candidates are considered, the 
ideal candidate possesses a Ph.D. and a track record of excellence in research and teaching. SU, a 
private, innovative academic institution, offers excellent facilities and a light teaching load amid a 
modern campus located in Istanbul, Turkey. The University admits top-ranking faculty and students 
and offers undergraduate, М.В.А., Executive М.В.А. and Ph.D. programs in Operations Manage- 
ment, Organization Studies, and Finance. It provides faculty excellent support and competitive 
compensation, including housing. English is the medium of instruction. For further information, 
visit: http://www.sabanciuniv.edu/ybf/eng/. Applicants are requested to complete the online applica- 
tion form at: https://crm.sabanciuniv.edu/acadcv/application.php, using ACC2011 as the Code Ref- 
erence, and to send their cv and the names of three references to: Ulf Nilsson, 
nilsson@sabanciuniv.edu or Mine Aksu, maksu@sabanciuniv.edu. We will be interviewing at the 
Annual Meeting. 
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QUINNIPIAC UNIVERSITY invites applications for a full-time, tenure-track accounting faculty 
positioa, rank open, to start in August 2011. Consideration will be given to mdividuals interested in 
a full-tme visiting position for the 2011-12 academic year. Primary teaching responsibilities will be 
in the financial accounting area. All candidates must hold a doctorate in accounting from an 
AACSB-accredited school (ABD considered for the visiting position). Successful candidates must 
demonstrate a strong potential for or have an established record of excellent teaching, published 
research, and significant service. Qualified candidates should apply online at https:// 
careers.quinnipiac.edu. Application materials should include a curriculum vitae, evidence of teach- 
ing success, and the names of three гегегепсез. Other inquiries should be directed to: Kathleen 
Simions, Chair of the Accounting Department at: accountingsearch @quinnipiac.edu. Quinnipiac 
University has a strong commitment to the principles and practices of diversity throughout the 
University community and welcomes candidates who would enhance that diversity. 


OREGON STATE UNIVERSITY invites applicants for two anticipated full-time (1.0 FTE), nine- 
month, tenure-track positions in Accounting at the rank of Assistant Professor to begin September 
2011. Teaching responsibilities include graduate and undergraduate accounting courses. Scholarly 
activities are expected to result in peer-reviewed journal publications. Required qualifications in- 
clude an appropriate earned doctorate from an AACSB International accredited university or equiva- 
lent and evidence of potential for scholarly publication and teaching effectiveness. Preferred quali- 
fications include professional certification and a demonstrable commitment to promoting and 
enhancing diversity. Information about the position and the University’s online application process 
can be accessed at: http://oregonstate.edu/jobs, Posting No. 006857. Applications, including (1) a 
letter of interest clearly indicating interest in the position, (2) cv, (3) contact information for three 
professional references, (4) recent teaching evaluations and summary information of courses taught, 
and (5) evidence of scholarly work should be submitted by 3/15/2011. OSU is an AA/EOE. 


TOWSON UNIVERSITY, College of Business and Economics invites applications for the position 
of Chair of the Department of Accounting, effective Summer or Fall 2011. The College of Business 
and Eccnomics is an AACSB-accredited institution with more than 3,200 undergraduate business 
majors. The Department of Accounting has 18 full-time faculty members, more than 600 under- 
graduate majors, and more than 60 graduate students in a joint master's program in accounting with 
the University of Baltimore. Additional information can be found at: http://wwwnew.towson.edu/ 
cbe/. Applicants should possess a Ph.D./D.B.A. with a concentration in accounting or closely related 
field from an AACSB-accredited university, and must provide evidence of effective teaching and 
intellectual contributions. Experience with the AACSB-accreditation process and requirements are 
highly desirable. This is a renewable 12-month administrative appointment at the Clinical Associate 
Professor level for an expected initial period of five years, with a reduced teaching load. Compen- 
sation is competitive. Review of applications will begin immediately. Interested candidates should 
submit a letter of application, current curriculum vitae, recent teaching evaluations (if applicable), 
and the names of three references to: Dr. Andrew Schiff, Search Committee Chair, Towson Univer- 
sity, Department of Accounting, Stephens Hall, Room 102, 8000 York Road, Towson, MD 21252- 
0001, or Email: aschiff@towson.edu. Towson University is an Equal Opportunity/Affirmative Ac- 
tion Employer and has a strong institutional commitment to diversity. Women, minorities, persons 
with disabilities, and veterans are encouraged to apply. 
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WEST VIRGINIA UNIVERSITY, Department of Accounting and MIS in the College of Business 
and Economics invites applications for a tenure-track appointment at the Assistant or Associate 
Professor level beginning in August 2011. Our primary need is in Accounting Information Systems, 
but we welcome applications in all areas of accounting research and teaching. We anticipate filling 
at least one position. West Virginia University is a land-grant institution located in Morgantown, 
West Virginia, close to major metropolitan centers in the northeast. The Department of Accounting 
and MIS is separately accredited by the AACSB and offers undergraduate and graduate degrees, 
including a planned doctoral program. Applicants should have an earned doctorate in accounting (or 
equivalent) from an AACSB-accredited institution and exhibit effectiveness in teaching and the 
capacity for research publishable in respected scholarly outlets. CPA or other professional certifica- 
tion preferred. Interested candidates should submit a current vita, a research paper suitable for 
presentation at a workshop, the names and contact information of three references, and evidence of 
teaching ability to: Dr. Barbara Apostolou, Chair of the Search Committee, via email at: 
Barbara. Apostolou @ mail.wvu.edu. Screening of candidates will begin immediately and continue 
until the position is filled. West Virginia University is ап Equal Employment Opportunity/ 
Affirmative Action Institution. The university does not discriminate on the basis of age, color, 
disability, national origin, race, religion, sexual orientation, or veteran status. West Virginia Univer- 
sity is the recipient of an NSF ADVANCE award for gender equity. Minorities, persons with dis- 
abilities, females, and other protected class members are encouraged to apply. This offer of employ- 
ment is conditioned upon approval by all the appropriate governmental authorities. 


GEORGIA COLLEGE & STATE UNIVERSITY, J. Whitney Bunting College of Business is accept- 
ing applications for an academic-year tenure-track position in accounting beginning August 2011. 
The successful candidate will be able to teach introductory, upper-level, and graduate accounting 
courses. Preference will be given to candidates with a record of successful teaching in the area of 
financial accounting. Qualified candidates will have completed, or be near completion, of a Ph.D. or 


D.B.A. degree in Accounting from a business school with AACSB Accreditation. Professional ex- 
perience and certification are considered very positive complements to academic qualifications. 
Excellence in teaching, scholarly activity, and university/community service are requirements for 
promotion and tenure. То be considered, applicants must apply electronically at http:// 
www.gcsujobs.com 


ROBERT MORRIS UNIVERSITY (RMU), founded in 1921, has offered high-quality academic 
programs in business to students from the Pittsburgh region. RMU's School of Business, accredited 
by AACSB International, seeks qualified candidates with teaching interests in financial/managerial/ 
cost accounting or accounting information systems. Qualified candidates should have an earned 
doctorate in Accounting from an AACSB-accredited school or be ABD near completion. Candidates 
must demonstrate excellence in teaching and research in their disciplines. Candidates should submit 
a letter of interest, current vita, and three academic and/or professional references to: 
Business Grmu.edu, c/o Dr. Jacobs, Dean, or mail to: Robert Morris University, 6001 University 
Boulevard, Moon Township, РА 15108, c/o Dr. Jacobs. Visit http:/Avww.rmu.jobs for position de- 
scriptions. Anticipated start date is Fall 2011 and dependent upon budgetary approval. RMU 18 
committed to increasing diversity in its community and actively pursues individuals from all back- 
grounds. EOE 
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PURDUE UNIVERSITY, Krannert Greduate School of Management invites applicants for one ог 
more Continuous Term Lecturer appoin-mentis) in accounting beginning Fell 2011. Candidates will 
be expected to teach both Introduction to Tax and Advanced Taxation and be capable of teaching 
both financial and managerial accounting, primarily at the undergraduate level. A full-time teaching 
load is typically 24 credit hours per year. A successful candidate will have а demonstrated record of 
excellence in teaching preferably at comparable schools. A minimum of a master’s degree with a 
professional certification in accounting is required; a Ph.D. with a concentration in accounting or 
related area is strongly preferred. Applications will be reviewed beginning February 15, 2011 and 
should include a vitae, evidence of teaching performance, and three letters of reference. Applications 
should be sent to: Professor Susan Wat:s, Krannert Graduate School of Management, Purdue Uni- 
versity, 403 W. State Street, West Lafayetie, IN 47907-2056. Purdue University is an Equal 
Opportunity/Equal Access/Affirmative Action Employer fully committed to achieving a diverse 
workforce. 


UNIVERSITY OF WISCONSIN-STEVENS POINT is seeking candidates for an Assistant Profes- 
sor of Accounting. The successful candidate will teach introductory and advanced courses in Ac- 
counting, with preference given to an interest in Auditing. Prefer Ph.D/D.B.A. in accounting from 
ап AACSB-accredited institution. ABD will be considered. Ph.D. required for tenure. Send a letter 
of application including primary areas of teaching interest, resume or vita, copies of transcripts, and 
a list of three references (including phone numbers) to: Dr. Gary Mullins, Head School of Business 
and Economics, University of Wisconsin-Stevens Point, Stevens Point, WI 54481-3897; Email: 
gmullins@uwsp.edu. Screening will begin February 22, 2011. Position open until filled. Please see 
full position announcement at: http://www.UWSP.edu. | 


UNIVERSITY OF WASHINGTON BOTHELL, Business Program invites applications for a tenured 


opening in Accounting at the Professor or Associate Professor level. The appointment will become 
effective in the Fall 2011. Applicants must have a Ph.D. in Accounting and a commitment to 
excellence in scholarship and teaching. University of Washington faculty members engage in re- 
search, teaching, and service. Candidates should expect to teach at all levels of the undergraduate 
and graduate curriculum. They must be zommitted to working with diverse student and community 
populations. This position is contingent upon available funding. The successful candidate will have 
an established record of publications in high-quality academic journals and demonstrated excellence 
in teaching. In addition, he or she will play an important role in the continued development of the 
recently launched Business Program Accounting Option and new initiatives in the accounting area. 
This will include program and curriculum development, scheduling, mentoring junior faculty, and 
hiring part-time faculty. To apply: Please send a letter describing your qualifications and interest in 
the position as well as your curriculum vitae to: Dr. Steve Holland Chair, Accounting Search 
Committee, Business Program, Box 358533, University of Washington Bothell, 18115 Campus Way 
NE, Bothell, WA 98911-8246; Phone: (425) 352-5232; Fax: (425) 352-5277; Email: 
sholland@uwb.edu. For additional information, please see our website at: http://www.uwb.edu/bus/. 


TEXAS CHRISTIAN UNIVERSITY, M. J. Neeley School of Business, Department of Accounting 
invites applications for a second Assistant Professor position to begin in August 2011. Candidates 
should hold a doctoral degree or expect to complete their degree prior to the start date. We seek 
candidates who are both excellent teachers and excellent researchers, consistent with TCU’s 
Teacher-Scholar model. Applications should include a cover letter, curriculum vitae, and letters from 
at least three references. Applications may be submitted in either hardcopy or electronically, al- 
though electronic copies are preferred. Please send applications or nominations to: Ray Pfeiffer, 
Department Chair & Professor of Accounting, Texas Christian University, M. J. Neeley School of 
Business, TCU Box 298530, Fort Worth, TX 76129: Phone: (817) 257-6435: Email: 
rpfeiffer@tcu.edu. Review of applications will continue until the position is filled. EEO/AA. 
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ProQuest Accounting & Tax includes: 







* Highly ranked global and scholarly journals, * Tax law and financial law pub ications from noted 
including all journais from the American sources such as the American Bar Association 
Accounting Association and American Law Institute 

* Accounting, tax management and auditing trade * Global accounting and tax journals from the UK, 
publications from top publishers such as CCH, Canada, Australia, Ргагсе, Spain, Ireland, and more 






Aspen, Thomson Professional and Regulatory 
Services, Source Media, Tax Management пс, 
and Euromoney 






* Financial advising, governance, and nsk 
management publications from Incisive Media, 
CFA Institute, and Association for Financial 

* AICPA publications Counseling and Pla ning 













5 International Federation of Accountants (FAC) * Accounting history from publications such as 
standards The Accounting Historian's Journal 








* Tax law reviews from top insti-utions such as * Over 1,500 accounting dissertations and more 
New York University, Stanford Vande and than 7800 working papers from SSRN, plus 
University of Virginia select conference proceedings 


























ProQuest Accounting & Tax 
Visit WwW. proquest.com/go/accounting 


d sign up for a free trial 
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317 | MISSISSIPPI STATE UNIVERSITY-MERIDIAN, Division of Business, is accepting applications 
for the position of Assistant Professor of Accounting. Teaching emphasis will be in managerial/ 
financial accounting. This is a tenure-track position beginning August 16, 2011. Minimum qualifi- 
cations: Doctorate in Accounting from an AACSB-accredited institution or Doctorate in Business 
Administration with an emphasis in Accounting from an AACSB-accredited institution with dem- 
onstrated excellence in the field of accountancy. Preferred qualifications: Professional certification 
and practical experience in accounting. Responsibilities: Teach a variety of undergraduate and 
graduate courses in accounting, publishing high-quality research, and advising and mentoring stu- 
dents. Review of applications will begin immediately and continue until position is filled. Interested 
applicants must complete the Personal Data Information Form online ас http:// 
www.jobs.msstate.edu, and send: (1) letter of application; (2) vitae, (3) official transcripts; (4) three 
letters of recommendation; and (5) the name, address, and telephone number of three additional 
references to: Dr. Kevin Ennis, Search Committee Chair, Division of Business, MSU-Meridian, 
1000 Highway 19 North, Meridian, MS 39307. MSU is an AA/EOE. 
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* Highly ranked global and scholarly journals, 
including all journals from the American 
Accounting Association 


Accounting, tax management, and auditing trade 
publications from top publishers such as CCH, 
Aspen, Thomson Professional and Requiacory 
Services, Source Media, Tax Management Inc, 
and Euromoney 


AICPA publications 


International Federation of Accountants (IFAC) 
standards 


Tax law reviews from top institutions such as 
New York University, Stanford, Vanderbilt, and 
University of Virginia 
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Tax law and financial law publications from noted 
sources such as the Americar Bar Association 
and American Law Institute 


Global accounting and tax journals from the UK, 
Canada, Australia, France, Spain, Ireland, and more 


Financial advising, governance, and risk 
management publications from Incisive Media, 
CFA institute, and Association for Financial 
Counseling and Planning 


Accounting history from publications such as 
The Accounting Historian's Journal 


Over 1,500 accounting dissertations and more 
than 7800 working papers from SSRN. plus 
select conference proceedings 
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